摘要 |
<P>PROBLEM TO BE SOLVED: To read a character string robustly against a character recognition error and with fewer errors for forms which have obscure character string arrangements, without needing to pre-define a described position and attributes of a reading-target character string for a form group in which various layouts coexist. <P>SOLUTION: A form recognition device performs the steps of: detecting a character string area from a form image (S120); calculating, for the detected character strings, an item name likelihood representing likeness of an item name and an item value likelihood representing likelihood of an item value (S140, S150); calculating, for a character string pair constituted by a combination of the detected character strings, an arrangement likelihood representing validity of the arrangement relationship of the character string pair as an item name-item value relationship (S160); calculating an evaluation value of the item name-item value relationship based on the values of the item name likelihood, item value likelihood and arrangement likelihood (S170); and determining the item name-item value relationship in the form image (S180). <P>COPYRIGHT: (C)2012,JPO&INPIT |