摘要 |
PROBLEM TO BE SOLVED: To provide a technology of improving the reduction in a character recognition efficiency caused by an OCR incapable of recognizing semantic connections between words and sentences separated by typesetting, and a technology of attaining simultaneous optical character recognition for a plurality of pieces of printed matter. SOLUTION: Regional elements of a sentence, their connection relations and sequential relations are embedded in the sentence as electronic watermark information so that the sequential relations of characters can accurately be grasped before recognition processing to enhance the character recognition efficiency. Further, a regional designation method adopts a relative coordinate reference region to attain the simultaneous optical character recognition for a plurality of pieces of printed matters while accurately grasping the correspondence to the printed matters. COPYRIGHT: (C)2005,JPO&NCIPI
|