发明名称 DEVICE AND METHOD FOR CONSTRUCTION OF DATA BASE
摘要 PROBLEM TO BE SOLVED: To automatically extract the item definitions from plural areas which are enclosed by a ruled line in a document and to automatically construct a data base. SOLUTION: This device/method includes a document analysis part 11 which extracts a character string area that is enclosed by a ruled line in a document, a character string area accumulation means 121 which accumulates the character strings of character string areas of every document in each of areas having the common positions of these character strings, a common character string information storage means 122 which extracts the common character string information from plural character strings accumulated in each of common areas which are accumulated by the means 121 and stores these extracted character string information and a non-common character string acquisition means 131 which acquires the non-common character strings other than the common character strings. Then the character strings stored in the means 122 are set as the items of a data base and also the data acquired by the means 131 are successively accumulated as the data corresponding to the data base items.
申请公布号 JP2000250923(A) 申请公布日期 2000.09.14
申请号 JP19990050011 申请日期 1999.02.26
申请人 MATSUSHITA ELECTRIC IND CO LTD 发明人 MIYAKE KYOKO
分类号 G06F17/30;G06T1/00;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址