发明名称 DEVICE AND PROGRAM FOR EXTRACTING INFORMATION
摘要 <p><P>PROBLEM TO BE SOLVED: To provide an information extracting device capable of extracting information by deciding whether retrieved information is necessary information. <P>SOLUTION: The information extracting device 1 is equipped with a complete table storage part 20 which is connected to the Internet and stores pieces of item information corresponding to items in a table format wherein the items are set by columns while relating the pieces of information to the items. On receiving a user's instruction, a retrieval condition extracting part 21 reads item information out of the complete table storage part 20 by rows to extract a retrieval condition. A document retrieval part 22 retrieves a web page from the Internet based upon the retrieval condition. A table extracting part 24 extracts table information indicated with table tags of HTML from the retrieved web page. An information collation part 25 and a table extraction dictionary part 26 performs a specified operation for the table information to calculate weighting values by the pieces of item information. Then an extracted table decision part 28 decides whether the table information extracted from the web page is necessary as an object of information extraction by referring to the weighting values. <P>COPYRIGHT: (C)2006,JPO&NCIPI</p>
申请公布号 JP2005352547(A) 申请公布日期 2005.12.22
申请号 JP20040169568 申请日期 2004.06.08
申请人 NTT DATA CORP 发明人 NAKAJIMA HIROYUKI
分类号 G06F17/30;G06F19/00;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址