摘要 |
PROBLEM TO BE SOLVED: To provide a character segmentation device and a character segmentation method, by which characters are segmented correctly without the prerequisite of row extraction even in the case that a plurality of rows are inputted simultaneously. SOLUTION: The character segmentation device is provided with a character string image storage part 11 for storing character string images, a connection component extraction part 12 for binarizing the character string images and extracting connection components, a minimum unit preparation part 13 for judging whether or not they are in contact for all the connection components, turning the connection component itself to a minimum unit at the time of non- contact and cutting the connection component and newly preparing the minimum unit at the time of contact, a two-dimensional connection relation preparation part 14 for obtaining the connection relation of the minimum units with each other, a character segmentation candidate output part 15 for preparing a combination pattern by a plurality of the minimum units in the connection relation with each other and outputting the pattern with the high possibility of being a correct character as a character segmentation candidate, and a character segmentation establishment part 16 for performing a recognition processing, a language knowledge processing and layout analysis on the basis of the character segmentation candidate and outputting a character segmentation result.
|