发明名称 |
Systems and Methods for Extracting Table Information from Documents |
摘要 |
Systems and methods for extracting table information from documents are provided herein. Exemplary methods may include annotating a document with annotations that identify table cell data included therein, generating a candidate table for each of a plurality of table models using the annotated table cell data, scoring each of the candidate tables, selecting a highest scoring candidate table, and annotating the highest scoring table to produce a final table. |
申请公布号 |
US2015026556(A1) |
申请公布日期 |
2015.01.22 |
申请号 |
US201313943668 |
申请日期 |
2013.07.16 |
申请人 |
Stadermann Jan;Symons Stephan;Thon Ingo |
发明人 |
Stadermann Jan;Symons Stephan;Thon Ingo |
分类号 |
G06F17/24 |
主分类号 |
G06F17/24 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method, for extracting table information from an unstructured document using a table extraction system that comprises a processor and table extraction logic stored in memory, wherein the processor executes the table extraction logic to perform operations comprising:
annotating text of a document with annotations using domain knowledge of the unstructured document to produce annotated table cell data; generating a candidate table for each of a plurality of table models using the annotated table cell data; scoring each of the candidate tables; selecting a highest scoring candidate table; andproviding the highest scoring candidate table. |
地址 |
Rheinbach DE |