发明名称 FINDING REPEATED STRUCTURE FOR DATA EXTRACTION FROM DOCUMENT IMAGES
摘要 Methods and system employing the same for finding repeated structure for data extraction from document images are provided. A reference record and one or more reference fields thereof are identified from a document image. One or more candidate fields are generated for each of the reference fields. One or more best candidate records from the candidate fields are selected using a probabilistic model and an optimal record set is determined from the best candidate records.
申请公布号 US2012201457(A1) 申请公布日期 2012.08.09
申请号 US201113022877 申请日期 2011.02.08
申请人 BART EVGENIY;SARKAR PRATEEK;SAUND ERIC;PALO ALTO RESEARCH CENTER INCORPORATED 发明人 BART EVGENIY;SARKAR PRATEEK;SAUND ERIC
分类号 G06K9/34 主分类号 G06K9/34
代理机构 代理人
主权项
地址