发明名称 ADVANCED BOOK PAGE CLASSIFICATION ENGINE AND INDEX PAGE EXTRACTION
摘要 Embodiments of the present invention relate to classifying pages of an electronic document, such as a scanned book page. An algorithm, such as a constrained conditional random fields algorithm, is applied to the contents of the electronic document to determine the type of page the electronic document is. Page types may include table of contents (TOC), index, table of figures (TOF), bibliography, epilogue, prologue, foreword, glossary, or other types of pages typically found in a book, magazine, or other publication. Once determined, the contents of the page are extracted using the same algorithm, and labeled.
申请公布号 US2009327210(A1) 申请公布日期 2009.12.31
申请号 US20080163639 申请日期 2008.06.27
申请人 MICROSOFT CORPORATION 发明人 LIU ZHEN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址