发明名称 SYSTEM AND METHOD FOR AUTOMATING INFORMATION ABSTRACTION PROCESS FOR DOCUMENTS
摘要 A computer-implemented method, a processing pipeline and a system create a hierarchical semantic map of a document and extracted information. The method includes apportioning the document into major sections by accessing the document, recognizing a hierarchical structure of the document, and dividing the document into the major sections by using a data profiler and a machine learning module, classifying the major sections, and mapping the major sections to key elements in one of the multiple levels, searching one major section, and identifying sub-sections from the one major section to achieve a maximum confidence score indicates that the sub-sections associate with the key element, extracting the information from the identified sub-sections by using sequence modelers and linguistic characteristics provided by the data profiler, generating the hierarchical semantic map of the document by using the extracted information, and displaying in a user interface drop down selections of the key elements.
申请公布号 EP3104285(A1) 申请公布日期 2016.12.14
申请号 EP20160173526 申请日期 2016.06.08
申请人 Accenture Global Services Limited 发明人 Shubhashis, Sengupta;Annervaz, Karukapadath Mohamedrasheed;Chakravarthy, Lakshminarasimhan;Manisha, Kapur;Jovin, George;Mansi, Srivastava;Vaidya, Sumanth;Rajeh, Ganesh Natrajan;Siddesha, Swamy
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址