发明名称 |
FINDING REPEATED STRUCTURE FOR DATA EXTRACTION FROM DOCUMENT IMAGES |
摘要 |
Methods and system employing the same for finding repeated structure for data extraction from document images are provided. A reference record and one or more reference fields thereof are identified from a document image. One or more candidate fields are generated for each of the reference fields. One or more best candidate records from the candidate fields are selected using a probabilistic model and an optimal record set is determined from the best candidate records.
|
申请公布号 |
US2012201457(A1) |
申请公布日期 |
2012.08.09 |
申请号 |
US201113022877 |
申请日期 |
2011.02.08 |
申请人 |
BART EVGENIY;SARKAR PRATEEK;SAUND ERIC;PALO ALTO RESEARCH CENTER INCORPORATED |
发明人 |
BART EVGENIY;SARKAR PRATEEK;SAUND ERIC |
分类号 |
G06K9/34 |
主分类号 |
G06K9/34 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|