Skip to content

Analysis

HTML analysis modules for readability, forms, tables, and metadata extraction.

6 modules

ModuleDescription
Legibilidade HTMLAnalisar legibilidade do conteudo
Extrair FormulariosExtrair dados de formulario do HTML
Extrair MetadadosExtrair metadados do HTML
Extrair TabelasExtrair dados de tabela do HTML
Encontrar PadroesEncontrar padroes de dados repetitivos no HTML
Estrutura HTMLAnalisar estrutura DOM do HTML

Modules

Legibilidade HTML

analysis.html.analyze_readability

Analisar legibilidade do conteudo

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Extrair Formularios

analysis.html.extract_forms

Extrair dados de formulario do HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Extrair Metadados

analysis.html.extract_metadata

Extrair metadados do HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Extrair Tabelas

analysis.html.extract_tables

Extrair dados de tabela do HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Encontrar Padroes

analysis.html.find_patterns

Encontrar padroes de dados repetitivos no HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Estrutura HTML

analysis.html.structure

Analisar estrutura DOM do HTML

Parameters:

NameTypeRequiredDefaultDescription
htmlstringYes-HTML content to analyze

Output:

FieldTypeDescription
typeanyobject
propertiesany

Released under the Apache 2.0 License.