发明名称 HIDDEN-WEB TABLE INTERPRETATION, CONCEPTULIZATION AND SEMANTIC ANNOTATION
摘要 Indexing hidden web information. First and second web pages are accessed, which include data organized in table format. The tables from the first and second web page are compared. Based on the comparison, a determination is made as to which table cells contain category labels and which contain instance data. The category labels from the first web page are compared to the category labels from the second web page. A general structure of individual tables is inferred based on the act of comparing the category labels. The general structure is chosen from among standard table templates. Data in two or more web pages organized according to the selected table templates is identified. Data from the two or more web pages is stored by associating the table data from two or more web pages to one or more of the selected table templates.
申请公布号 US2010114902(A1) 申请公布日期 2010.05.06
申请号 US20090612590 申请日期 2009.11.04
申请人 BRIGHAM YOUNG UNIVERSITY 发明人 EMBLEY DAVID W.;LIDDLE STEPHEN W.;TAO CUI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址