发明名称 METHOD AND SYSTEM FOR COLLECTING AND RETRIEVING INFORMATION FROM WEB SITES
摘要 A method and system for collecting and retrieving Web pages is described. One embodiment acquires a set of Web pages; for each Web page in the set of Web pages, analyzes the Web page for data artifacts, classifies each data artifact on the Web page as one of a predetermined set of types, and indexes and organizes, in at least one data structure, each classified data artifact, each indexed and organized data artifact in the at least one data structure being associated with a subject, all indexed and organized data artifacts that are associated with a non-unique subject being associated with a single subject entry; receives a query indicating a particular subject to be searched; retrieves search results from the at least one data structure, the search results including a set of data artifacts associated with the particular subject; and displays at least some of the search results, the displayed data artifacts in the search results being grouped in accordance with their respective types, the displayed data artifacts in the search results within each type being listed in descending order of relevance to the particular subject.
申请公布号 US2008147631(A1) 申请公布日期 2008.06.19
申请号 US20060610936 申请日期 2006.12.14
申请人 发明人 LEFFINGWELL DEAN;MILLER JEREMIE
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址