发明名称 |
SYSTEM AND METHOD FOR AUTOMATING INFORMATION ABSTRACTION PROCESS FOR DOCUMENTS |
摘要 |
A computer-implemented method, a processing pipeline and a system create a hierarchical semantic map of a document and extracted information. The method includes apportioning the document into major sections by accessing the document, recognizing a hierarchical structure of the document, and dividing the document into the major sections by using a data profiler and a machine learning module, classifying the major sections, and mapping the major sections to key elements in one of the multiple levels, searching one major section, and identifying sub-sections from the one major section to achieve a maximum confidence score indicates that the sub-sections associate with the key element, extracting the information from the identified sub-sections by using sequence modelers and linguistic characteristics provided by the data profiler, generating the hierarchical semantic map of the document by using the extracted information, and displaying in a user interface drop down selections of the key elements. |
申请公布号 |
EP3104285(A1) |
申请公布日期 |
2016.12.14 |
申请号 |
EP20160173526 |
申请日期 |
2016.06.08 |
申请人 |
Accenture Global Services Limited |
发明人 |
Shubhashis, Sengupta;Annervaz, Karukapadath Mohamedrasheed;Chakravarthy, Lakshminarasimhan;Manisha, Kapur;Jovin, George;Mansi, Srivastava;Vaidya, Sumanth;Rajeh, Ganesh Natrajan;Siddesha, Swamy |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|