Skip to content

Analysis

HTML analysis modules for readability, forms, tables, and metadata extraction.

6 modules

ModuleDescription
Czytelnosc HTMLAnalizuj czytelnosc tresci
Wyodrebnij formularzeWyodrebnij dane formularzy z HTML
Wyodrebnij metadaneWyodrebnij metadane z HTML
Wyodrebnij tabeleWyodrebnij dane tabel z HTML
Znajdz wzorceZnajdz powtarzajace sie wzorce danych w HTML
Struktura HTMLAnalizuj strukture DOM HTML

Modules

Czytelnosc HTML

analysis.html.analyze_readability

Analizuj czytelnosc tresci

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Wyodrebnij formularze

analysis.html.extract_forms

Wyodrebnij dane formularzy z HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Wyodrebnij metadane

analysis.html.extract_metadata

Wyodrebnij metadane z HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Wyodrebnij tabele

analysis.html.extract_tables

Wyodrebnij dane tabel z HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Znajdz wzorce

analysis.html.find_patterns

Znajdz powtarzajace sie wzorce danych w HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Struktura HTML

analysis.html.structure

Analizuj strukture DOM HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Released under the Apache 2.0 License.