发明名称 Systems and Methods for Extracting Table Information from Documents
摘要 Systems and methods for extracting table information from documents are provided herein. Exemplary methods may include annotating a document with annotations that identify table cell data included therein, generating a candidate table for each of a plurality of table models using the annotated table cell data, scoring each of the candidate tables, selecting a highest scoring candidate table, and annotating the highest scoring table to produce a final table.
申请公布号 US2015026556(A1) 申请公布日期 2015.01.22
申请号 US201313943668 申请日期 2013.07.16
申请人 Stadermann Jan;Symons Stephan;Thon Ingo 发明人 Stadermann Jan;Symons Stephan;Thon Ingo
分类号 G06F17/24 主分类号 G06F17/24
代理机构 代理人
主权项 1. A method, for extracting table information from an unstructured document using a table extraction system that comprises a processor and table extraction logic stored in memory, wherein the processor executes the table extraction logic to perform operations comprising: annotating text of a document with annotations using domain knowledge of the unstructured document to produce annotated table cell data; generating a candidate table for each of a plurality of table models using the annotated table cell data; scoring each of the candidate tables; selecting a highest scoring candidate table; andproviding the highest scoring candidate table.
地址 Rheinbach DE