摘要 |
<p>A method of and system for identifying a target form for increased efficiency in an automated data capture process. Forms (10) are scanned (11) and stored (12) as digitized images (22). Regions are defined on the form relative to corresponding reference points between the form and the digitized image. The regions are defined in areas that contain anticipated digitized data from data fields of the form. Digitized data is recognized (15) through such mechanisms as optical character recognition (OCR) and the resulting string variable or identifier (23) is compared (16) in form to a plurality of formats expected (17) for that data. Scoring systems (18) are used to attain a resultant score for a number of string variables which is compared to a predetermined confidence number or threshold. If the confidence number is reached, the form is identified as a target form and used in the data capture process. A first step identification of certain graphical features (72) can be added as an initial determination (275) as to the source of the form.</p> |