发明名称 |
DEVICE, METHOD, AND PROGRAM FOR GATHERING WEB INFORMATION |
摘要 |
<p><P>PROBLEM TO BE SOLVED: To reconstruct semi-structured or unstructured web information such as an HTML file and a PDF file for integrating necessary web information and facilitating classification and search. <P>SOLUTION: An attribute and an attribute value of gathered meta-information are compared with a vocabulary set list to determine whether they correspond or not. Then, a web information integration device refers to an attribute characteristic rule to determine whether they correspond or not. The web information integration device assumes the meta-information coincident in the all of determinations as information related to a product and stores the meta-information and the web information. The web information integration device also stores integration information coincident with the vocabulary set list and a rule coincident with the attribute characteristic rule as meta-metainformation. The meta-metainformation is used by a retrieval means for searching optional meaning from the stored web information. <P>COPYRIGHT: (C)2008,JPO&INPIT</p> |
申请公布号 |
JP2008226204(A) |
申请公布日期 |
2008.09.25 |
申请号 |
JP20070067837 |
申请日期 |
2007.03.16 |
申请人 |
NEC CORP |
发明人 |
HOSONO SHIGERU;MATSUMOTO SHIGEAKI;KITANO TAKATOSHI |
分类号 |
G06F17/30;G06Q10/00;G06Q50/00;G06Q50/10 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|