发明名称 METHOD AND APPARATUS OF DIVIDING DOMAINS IN DOCUMENT IMAGE
摘要 PROBLEM TO BE SOLVED: To divide domains in a document image by using a layout of multi-columns in either case, clear or irregular. SOLUTION: A method comprises the following steps: for compensating 102 a skew of an input document image 101, then for generating 103 a compressed image, for extracting 104 small domains, for categorizing 106 the small domains in the line direction into candidates for a series of characters and the like, for extracting 107 vacant parts between columns by using successive long white runs from the small domains of candidates for a series of characters, for deciding 108 type of layout of multi-columns as a single column, a plurality of columns or free column, for selecting 109 the vacant parts depending on the type, and for extracting the domain of sentences by integrating 110 the small domains. COPYRIGHT: (C)2004,JPO&NCIPI
申请公布号 JP2004246929(A) 申请公布日期 2004.09.02
申请号 JP20040163074 申请日期 2004.06.01
申请人 RICOH CO LTD 发明人 SAITO TAKASHI
分类号 G06T11/60;(IPC1-7):G06T11/60 主分类号 G06T11/60
代理机构 代理人
主权项
地址