摘要 |
PROBLEM TO BE SOLVED: To make it possible to properly and efficiently read out an input character string even in the case of an input pattern mixing a pattern such as a noise other than characters and having a size similar to the size of characters in a pattern for correcting a noise due to the low quality of an input system, a double line, an erasing/paint-out, and the like. SOLUTION: Plural character strings to be recognized about characters are previously registered in a word dictionary 9 together with the order of respective characters constituting these character strings, a segmentation processing part 4 segments candidate segments based on the image of character strings written in an inputted paper medium and a character recognition part 5 recognizes characters in each candidate segment by referring to an individual character recognition dictionary 8 including at least respective characters constituting the previously registered character strings and allows candidate segments having a character recognition result more than prescribed matching quantity to correspond to respective characters of plural character strings coincident with the character recognition result. A knowledge processing part 6 extracts a character string candidate consisting of the combination of candidate segments suited to the order relation of plural character strings in each plural character strings, calculates the evaluation value of each character string candidate based on the matching quantity of the extracted character string candidate and judges the character strings written in the paper medium.
|