Doc Handlers

Base Handler

class fuzzy_table_extractor.doc_handlers.BaseHandler(file_path: pathlib.Path)

The Base Handler is an abstract class that defines the interface for all other handlers. This interface will be used by the Extractor to search for tables in the document.

property dictionary: pandas.DataFrame

All cell couples in document

property tables: List[pandas.DataFrame]

List of all tables (as dataframes) in document

property words: List[str]

List of all words in document