摘要 |
PROBLEM TO BE SOLVED: To achieve an appropriate character row area information even if a document image of a document in which vertical writing and horizontal writing are mingled is provided unconditionally. SOLUTION: A row candidate area which can be assumed to be a character row is extracted (a circumscribed rectangle of connected components of black pixels is obtained and adjoining rectangles are integrated) from an objective document image as a row candidate in a vertical/horizontal direction, if an overlapped area is produced between the extracted vertical/horizontal row candidate areas, the likelihood of character row of each row candidate area is calculated (step S5) as row likelihood (for instance, the feature quantity of the row: row length, row height, aspect ratio of row size, distance between the connected components, connected component size, their fluctuations and the like), according to the result, an inappropriate row candidate in the overlapped area is deleted, and appropriate character row area information is outputted. COPYRIGHT: (C)2004,JPO
|