摘要 |
PROBLEM TO BE SOLVED: To recognize a character based on image information read from a description document and a paper document to automatically extract a character recognition result proposal sequence, to automatically extract a keyword proposal based on the character recognition result proposal sequence, to automatically select a keyword from the keyword proposal, based on reliability, significance and a field, and to automatically extract the keyword from the description document and the image information, as to keyword extracting/search system for extracting the keyword from the image information of the document. SOLUTION: This system is provided with a character recognizing part for recognizing the character based on the image information of the original document to generate the character recognition result proposal sequence, a keyword extracting part for extracting, as the keyword proposal, one consistent with by retrieving a word dictionary as to the proposal sequence, or one having a value of a prescribed threshold value or more by finding reliability of the keyword based on the reliability of the individual proposal sequence, and a keyword selecting part for selecting the keyword out of the keyword proposals when the significance corresponding to a position in the original document of the each keyword proposal is the prescribed threshold value or more. COPYRIGHT: (C)2004,JPO
|