发明名称 A method and means for enhancing optical character recognition of printed documents.
摘要 A document marker (27) including first values dependent upon the layout and the contents of the document and assigned by generating or preprocessing software (21) is provided in machine-readable symbology on the face of a printed version (24) of the document. The marker (27) may include encoded document layout information and values assigned on sequences of the original text, including text-dependent decimation sequences, error correction codes or check-sums. Upon optical character recognition scanning (16), or other digitizing reproduction, the marker (27) is also scanned. The scanning computer (28), having corresponding software (29,26), assigns second values dependent upon the layout and contents of the reproduced document. Upon comparison of the first and second decimation sequences, line and character errors can be detected and some errors corrected, thereby generating re-aligned candidate sequences. Optional error correction codes can provide further correcting capabilities, as applied to the re-aligned reproduced document sequences, and an optional check-sum comparison can be utilized to verify that the accuracy of the reproduced sequences is correct. <IMAGE>
申请公布号 EP0649112(A3) 申请公布日期 1995.11.02
申请号 EP19940307242 申请日期 1994.10.04
申请人 MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. 发明人 LOPRESTI, DANIEL P.;SANDBERG, JONATHAN S.
分类号 G06F11/10;G06F17/21;G06F17/22;G06K9/03;G06K9/20;G06T1/00 主分类号 G06F11/10
代理机构 代理人
主权项
地址