摘要 |
<P>PROBLEM TO BE SOLVED: To attain higher accuracy by unique area division corresponding to layout unique to a language while attaining higher efficiency of layout analysis processing. <P>SOLUTION: An image processor is equipped with a first area extraction part which extracts document image data by dividing it by every document area including characters on the basis of a rule without depending on classification of the language included in the document image data, a classification decision part which decides the classification of the language used in the document image data and a second area extraction part which divides or couples the extracted document area on the basis of the rule according to the decided classification of the language to extract the document area. <P>COPYRIGHT: (C)2007,JPO&INPIT |