kgdata.wikipedia.datasets.easy_tables#

Functions

easy_tables()

Tables that can be labeled automatically easily.

get_n_headers(tbl)

is_easy_table(tbl, tests)

Determine if a table is easy or not.

Classes

easy_tables() Dataset[LinkedHTMLTable][source]#

Tables that can be labeled automatically easily. The table is easy or not is determined by kgdata.wikipedia.easy_table.is_easy_table()

Return type:

Dataset[LinkedHTMLTable]

is_easy_table(tbl: LinkedHTMLTable, tests: List[Callable[[LinkedHTMLTable], bool]]) bool[source]#

Determine if a table is easy or not.

Parameters:
Return type:

bool

get_n_headers(tbl: LinkedHTMLTable) int[source]#
Parameters:

tbl (LinkedHTMLTable) –

Return type:

int

class EasyTests[source]#

Bases: object

MIN_ROWS = 10#
static min_rows(tbl: LinkedHTMLTable) bool[source]#

Determine if a table has at least min_rows rows.

Parameters:
  • tbl (LinkedHTMLTable) – Input table.

  • min_rows – Minimum number of rows.

Return type:

bool

Parameters:

tbl (LinkedHTMLTable) –

Return type:

bool

Parameters:

tbl (LinkedHTMLTable) –

Return type:

bool

Parameters:

tbl (LinkedHTMLTable) –

Return type:

bool

Parameters:

tbl (LinkedHTMLTable) –

Return type:

bool