发明名称 BUSINESS DOCUMENT PROCESSOR
摘要 <p>There is provided a technique for removing only a seal impression while keeping character string information when applying OCR to a business document stored in grayscale, even if the character string and the seal impression overlap with each other. The character string that overlaps with the seal impression is extrapolated by matching a character string present near the seal impression against a database. More specifically, first, a seal impression region in a business document inputted in grayscale is removed. Next, character information that is present near the removed seal impression region and of which a portion of the characters is unclear due to the seal impression region is extracted as seal impression related information. Then, an attribute of the extracted seal impression related information is identified, a customer database storing character string candidates containing customer information is referred to, and based on the seal impression related information classified by attribute, the character string that overlaps with the seal impression region and that is thus unclear is extrapolated.</p>
申请公布号 WO2010073540(A1) 申请公布日期 2010.07.01
申请号 WO2009JP06889 申请日期 2009.12.15
申请人 HITACHI SOFTWARE ENGINEERING CO., LTD.;OBA, MITSUHARU 发明人 OBA, MITSUHARU
分类号 G06K9/34;G06F17/30;G06K9/20;G06K9/72 主分类号 G06K9/34
代理机构 代理人
主权项
地址