摘要 |
A character recognition apparatus comprises an extracting part for extracting a feature of a character within an image data related to a document in a form of a feature vector having components which are histogram values of n degree peripheral pattern describing the feature of the character by distances from an arbitrary side of a character frame to each of first through nth detected contour of the character within the frame, a compression part for compressing the feature vector into a compressed feature vector having components which are quantized data, a dictionary which stores compressed feature vectors of standard characters, and a matching part for matching the compressed feature vector obtained from the compression part with each of the compressed feature vectors stored in the dictionary so as to output at least one candidate character corresponding to one of the standard characters described by a compressed feature vector having a minimum difference with the compressed feature vector obtained from the compression part out of the compressed feature vectors stored in the dictionary.
|