摘要 |
A document marker (27) including first values dependent upon the layout and the contents of the document and assigned by generating or preprocessing software (21) is provided in machine-readable symbology on the face of a printed version (24) of the document. The marker (27) may include encoded document layout information and values assigned on sequences of the original text, including text-dependent decimation sequences, error correction codes or check-sums. Upon optical character recognition scanning (16), or other digitizing reproduction, the marker (27) is also scanned. The scanning computer (28), having corresponding software (29,26), assigns second values dependent upon the layout and contents of the reproduced document. Upon comparison of the first and second decimation sequences, line and character errors can be detected and some errors corrected, thereby generating re-aligned candidate sequences. Optional error correction codes can provide further correcting capabilities, as applied to the re-aligned reproduced document sequences, and an optional check-sum comparison can be utilized to verify that the accuracy of the reproduced sequences is correct. <IMAGE> |