发明名称 |
TABLE RECOGNITION METHOD AND TABLE RECOGNITION DEVICE |
摘要 |
PROBLEM TO BE SOLVED: To determine whether a character string in a table is an item or data while solving ambiguity thereof. SOLUTION: The ambiguity of an item or data is solved by the following steps. (1) An item likelihood of each character string is calculated based on a language pattern and a layout pattern. (2) A word co-occurrence likelihood and a layout pattern co-occurrence likelihood are calculated for each combination of labels of vertically and laterally adjacent character strings. (3) A combination of labels is selected wherein the product of the likelihood by (1) and the likelihoods by (2) is the highest. COPYRIGHT: (C)2009,JPO&INPIT
|
申请公布号 |
JP2009169844(A) |
申请公布日期 |
2009.07.30 |
申请号 |
JP20080009505 |
申请日期 |
2008.01.18 |
申请人 |
HITACHI SOFTWARE ENG CO LTD |
发明人 |
FUJIO MASAKAZU;ONOYAMA TAKASHI;NAKASHIGE AKIRA;MARUKAWA KATSUMI;EISAKI TAKESHI |
分类号 |
G06K9/20 |
主分类号 |
G06K9/20 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|