发明名称 SYSTEMS AND METHODS FOR AUTOMATICALLY EXTRACTING DATA FROM ELETRONIC DOCUMENTS USING MULTIPLE CHARACTER RECOGNITION ENGINES
摘要 In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of automatically extracting data from each received electronic document using a plurality of character recognition engines is provided. The method includes: automatically processing each received electronic document page using each of a plurality of recognition engines to extract data; comparing quality of data extracted from each of the recognition engines to assign a confidence score to the extracted data; and selecting extracted data having highest confidence score as the correct extracted data.
申请公布号 US2011255784(A1) 申请公布日期 2011.10.20
申请号 US201113007434 申请日期 2011.01.14
申请人 COPANION, INC. 发明人 WELLING GIRISH;SINGH VARTIKA;KRISHNA GOPAL;MAHATA TUSHAR;SARKAR NIRUPAM;NEOGI DEPANKAR;LADD STEVEN K.
分类号 G06K9/18 主分类号 G06K9/18
代理机构 代理人
主权项
地址