Skip to content

Analysis

HTML analysis modules for readability, forms, tables, and metadata extraction.

6 modules

ModuleDescription
Leggibilita HTMLAnalizza leggibilita del contenuto
Estrai FormEstrai dati form da HTML
Estrai MetadatiEstrai metadati da HTML
Estrai TabelleEstrai dati tabella da HTML
Trova PatternTrova pattern dati ripetuti in HTML
Struttura HTMLAnalizza struttura DOM HTML

Modules

Leggibilita HTML

analysis.html.analyze_readability

Analizza leggibilita del contenuto

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Estrai Form

analysis.html.extract_forms

Estrai dati form da HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Estrai Metadati

analysis.html.extract_metadata

Estrai metadati da HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Estrai Tabelle

analysis.html.extract_tables

Estrai dati tabella da HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Trova Pattern

analysis.html.find_patterns

Trova pattern dati ripetuti in HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Struttura HTML

analysis.html.structure

Analizza struttura DOM HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Released under the Apache 2.0 License.