发明名称 DEVICE, METHOD, AND PROGRAM FOR GATHERING WEB INFORMATION
摘要 <p><P>PROBLEM TO BE SOLVED: To reconstruct semi-structured or unstructured web information such as an HTML file and a PDF file for integrating necessary web information and facilitating classification and search. <P>SOLUTION: An attribute and an attribute value of gathered meta-information are compared with a vocabulary set list to determine whether they correspond or not. Then, a web information integration device refers to an attribute characteristic rule to determine whether they correspond or not. The web information integration device assumes the meta-information coincident in the all of determinations as information related to a product and stores the meta-information and the web information. The web information integration device also stores integration information coincident with the vocabulary set list and a rule coincident with the attribute characteristic rule as meta-metainformation. The meta-metainformation is used by a retrieval means for searching optional meaning from the stored web information. <P>COPYRIGHT: (C)2008,JPO&INPIT</p>
申请公布号 JP2008226204(A) 申请公布日期 2008.09.25
申请号 JP20070067837 申请日期 2007.03.16
申请人 NEC CORP 发明人 HOSONO SHIGERU;MATSUMOTO SHIGEAKI;KITANO TAKATOSHI
分类号 G06F17/30;G06Q10/00;G06Q50/00;G06Q50/10 主分类号 G06F17/30
代理机构 代理人
主权项
地址