发明名称 TABLE RECOGNITION METHOD AND TABLE RECOGNITION DEVICE
摘要 PROBLEM TO BE SOLVED: To determine whether a character string in a table is an item or data while solving ambiguity thereof. SOLUTION: The ambiguity of an item or data is solved by the following steps. (1) An item likelihood of each character string is calculated based on a language pattern and a layout pattern. (2) A word co-occurrence likelihood and a layout pattern co-occurrence likelihood are calculated for each combination of labels of vertically and laterally adjacent character strings. (3) A combination of labels is selected wherein the product of the likelihood by (1) and the likelihoods by (2) is the highest. COPYRIGHT: (C)2009,JPO&INPIT
申请公布号 JP2009169844(A) 申请公布日期 2009.07.30
申请号 JP20080009505 申请日期 2008.01.18
申请人 HITACHI SOFTWARE ENG CO LTD 发明人 FUJIO MASAKAZU;ONOYAMA TAKASHI;NAKASHIGE AKIRA;MARUKAWA KATSUMI;EISAKI TAKESHI
分类号 G06K9/20 主分类号 G06K9/20
代理机构 代理人
主权项
地址