发明名称 Systems and Methods for Processing Structured Data from a Document Image
摘要 Optical character recognition systems and methods including the steps of: capturing an image of a document including a set of numbers having a defined mathematical relationship; analyzing the image to determine line segments; analyzing each line segment to determine one or more character segments; analyzing each character segment to determine possible interpretations, each interpretation having an associated predicted probability of being accurate; forming a weighted finite state transducer for each interpretation, wherein the weights are based on the predicted probabilities; combining the weighted finite state transducer for each interpretation into a document model weighted finite state transducer that encodes the defined mathematical relationship; searching the document model weighted finite state transducer for the lowest weight path, which is an interpretation of the document that is most likely to accurately represent the document; and outputting an optical character recognition version of the captured image.
申请公布号 US2014067631(A1) 申请公布日期 2014.03.06
申请号 US201314019510 申请日期 2013.09.05
申请人 HELIX SYSTEMS INCORPORATED 发明人 DHUSE GREG;VANDEVENTER JOSEPH T.
分类号 G06Q40/00 主分类号 G06Q40/00
代理机构 代理人
主权项
地址