发明名称 AUTOMATED CLASSIFICATION OF DOCUMENT PAGES
摘要 A system and method are disclosed for automatically classifying images of pages of a source, such as a book, into classifications such as front cover, copyright page, table of contents, text, index, etc. In one embodiment, three phases are provided in the classification process. During a first phase of the classification process, a first classifier may be used to determine a preliminary classification of a page image based on single-page criteria. During a second phase of the classification process, a second classifier may be used to determine a final classification for the page image based on multiple-page and/or global criteria. During an optional third phase of classification, a verifier may be used to verify the final classification of the page image based on verification criteria. If automatic classification fails, the page image may be passed on to a human operator for manual classification.
申请公布号 EP2069980(A1) 申请公布日期 2009.06.17
申请号 EP20070814568 申请日期 2007.08.30
申请人 AMAZON TECHNOLOGIES, INC. 发明人 BEHM, BRADLEY, JEFFERY;WOOD, BRENT, ERIC
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址