发明名称 System and method for automatic page registration and automatic zone detection during forms processing
摘要 A system and method automatically detects user defined zones in a document image of a form, compensating for skew and displacement of the image with respect to a original image of form. The system provides a mechanism to input an image for a form document, such as a scanner. The system processes the image to reduce its resolution and to remove significant skew. The image is presented to the user to define the zones. These zones are areas from which the user desires to extract meaningful data through optical character recognition, such as names, dates, addresses, and items on a invoice form. The system further processes the image to remove horizontal and vertical lines, and to create a number of blocks, representing either text or image data. The lines are removed and the blocks formed by runlength smoothing with various parameters. The blocks form clusters of connected pixels. The blocks are labeled such that any set of connected blocks share a unique identification value. Additional data is collected on the commonly labeled blocks to select those blocks useful to definition of a template. The template is a collection of vectors between the centroids of each of the selected blocks. A second document image for processing is obtained, and similarly processed to minimize, deskew, and identify blocks and vectors therein. The vectors in the second document image are compared with vectors in an user selected template to determine the location of user defined zones in the second document image.
申请公布号 US5822454(A) 申请公布日期 1998.10.13
申请号 US19950419135 申请日期 1995.04.10
申请人 REBUS TECHNOLOGY, INC. 发明人 RANGARAJAN, VIJAYAKUMAR
分类号 G06K9/32;G06K9/20;G06T3/00;(IPC1-7):G06K9/34 主分类号 G06K9/32
代理机构 代理人
主权项
地址