发明名称 |
MODEL-BASED METHOD OF DOCUMENT LOGICAL STRUCTURE RECOGNITION IN OCR SYSTEMS |
摘要 |
In one embodiment, the invention provides a method for determining a logical structure of a document. The method comprises generating at least one document hypothesis for the whole document; for each document hypothesis, verifying said document hypothesis including (a) generating at least one block hypothesis for each block in the document based on the document hypothesis; and (b) selecting a best block hypothesis for each block; selecting as a best document hypothesis the document hypothesis that has the best degree of correspondence with the selected best block hypotheses for the document; and forming the document based on the best document hypothesis.
|
申请公布号 |
US2009087094(A1) |
申请公布日期 |
2009.04.02 |
申请号 |
US20080236054 |
申请日期 |
2008.09.23 |
申请人 |
DERYAGIN DMITRY;ANISMOVICH KONSTANTIN |
发明人 |
DERYAGIN DMITRY;ANISMOVICH KONSTANTIN |
分类号 |
G06K9/34 |
主分类号 |
G06K9/34 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|