摘要 |
PROBLEM TO BE SOLVED: To perform a detection in defiance of the reading order of document blocks while improving the speed of detection by performing a concept vector retrieval of a document for retrieval of an original computerized document from a scanned paper document, and detecting a similar document. SOLUTION: This system comprises a document storage means for storing the original computerized document; a document reading means for reading a printed document from the original computerized document as image data; a document recognition means for recognizing the read image data as a character code; a document analysis means for analyzing the recognized document data to extract layout information of the document; a document detection means for performing a concept vector retrieval of document for detection of the stored original computerized document from the analyzed document data to detect a similar document, and a detection result output means for outputting a detected result. COPYRIGHT: (C)2005,JPO&NCIPI
|