发明名称 GRAMMATICAL PARSING OF DOCUMENT VISUAL STRUCTURES
摘要 <p>A two-dimensional representation of a document is leveraged to extract a hierarchical structure that facilitates recognition of the document. The visual structure is grammatically parsed utilizing two-dimensional adaptations of statistical parsing algorithms. This allows recognition of layout structures (e.g., columns, authors, titles, footnotes, etc.) and the like such that structural components of the document can be accurately interpreted. Additional techniques can also be employed to facilitate document layout recognition. For example, grammatical parsing techniques that utilize machine learning, parse scoring based on image representations, boosting techniques, and/or“fast features”and the like can be employed to facilitate in document recognition.</p>
申请公布号 CA2614177(A1) 申请公布日期 2007.01.11
申请号 CA20062614177 申请日期 2006.06.30
申请人 MICROSOFT CORPORATION 发明人 VIOLA, PAUL A.;SHILMAN, MICHAEL
分类号 G06K9/72 主分类号 G06K9/72
代理机构 代理人
主权项
地址