摘要 |
A character string extraction apparatus comprises: a connected component (CC) detector for detecting, in a binary image, connected components (CC) comprising black pixels; a character-sized connected component (CharCC) extraction unit for extracting character-sized connected components (CharCC) having an appropriate size from the detected connected components; a horizontal extension unit and a vertical extension unit for extending the extracted character-sized connected components in an assumed character string direction, and for reducing the character-sized connected components in a direction perpendicular to the assumed character string direction; long connected component (LongCC) extraction units and for connecting a plurality of the thus obtained connected components in the assumed character string direction, and for extracting a long connected component; and a character string selector for employing the extracted long connected component to determine a character string for image recognition.
|