发明名称 Information processing apparatus for determining matching language for characters in image
摘要 An information processing apparatus of the present invention selects one language group, then selects one language from the selected language group, and performs OCR processing appropriate for the selected language on characters included in an image. From an obtained OCR processing result, a matching degree indicating a degree of similarity between the recognized characters in the image and the language selected for the OCR processing is calculated. Then, in a case where the matching degree is equal to or smaller than a particular value, a language belonging to a different language group is selected to further perform OCR processing. The efficiency of the OCR processing is improved. The information processing apparatus of the present invention allows improvement in the efficiency of the OCR processing.
申请公布号 US8831364(B2) 申请公布日期 2014.09.09
申请号 US201313757101 申请日期 2013.02.01
申请人 Canon Kabushiki Kaisha 发明人 Kawasaki Hiromasa
分类号 G06K9/72;G06K9/68 主分类号 G06K9/72
代理机构 Fitzpatrick, Cella, Harper & Scinto 代理人 Fitzpatrick, Cella, Harper & Scinto
主权项 1. An information processing apparatus, comprising: an input unit configured to input an image; a first selection unit configured to select one language group from a plurality of language groups, wherein a plurality of languages are classified into the plurality of language groups; a second selection unit configured to select one language belonging to the language group selected by the first selection unit; a character recognition unit configured to perform character recognition appropriate for the language selected by the second selection unit on characters included in the image inputted by the input unit to obtain a character recognition result; a calculation unit configured to, based on the character recognition result obtained by the character recognition unit, calculate a matching degree indicating a degree of similarity between the characters in the image on which the character recognition was performed and the language selected by the second selection unit; and a control unit configure to, in a case where the calculated matching degree is equal to or greater than a first threshold, determine that the characters in the image on which the character recognition was performed are of the language selected by the second selection unit, and output the character recognition result, in a case where the calculated matching degree is equal to or greater than a second threshold and smaller than the first threshold, cause the second selection unit to select a new language from other languages belonging to the selected language group and cause the character recognition unit to perform character recognition based on said selected new language, and in a case where the calculated matching degree is smaller than the second threshold, cause the first selection unit to select a new language group that is different from the selected language group, cause the second selection unit to select a new language from languages belonging to the selected new language group, and cause the character recognition unit to perform character recognition based on said selected new language.
地址 Tokyo JP