摘要 |
<p>Optical character recognition is achieved by a system which has a scanner (10) for scanning a document, an edge extractor (11) for identifying edges in the image produced by the scanner to produce an outline of each object identified in the image, a segmentation facility (15) for grouping the object outlines into blocks, means (14) for identifying features of the outlines, and a final classification stage (16) for providing data in an appropriate format representative of the characters in the image. Also disclosed are a novel edge extractor, a novel page segmentation facility and a novel feature extraction facility.</p> |