摘要 |
<p>In a method and system for collection of data from documents present in machine-readable form, at least one already-processed document stored as a template and designated as a template document is associated with a document to be processed designated as a read document. Fields for data to be extracted are defined in the template document. Data contained in the read document are already extracted from regions that correspond to the fields in the template document. Should an error have occurred or no suitable template document having been associated given the automatic extraction of the data, the read document is shown on a screen and fields are manually inputted in the read document from which the data are extracted. After the manual input of the fields in the read document, the read document with field specifications is stored as a new template document or the previous template document is corrected corresponding to the newly input fields.</p> |