发明名称 RELEVANT SEARCH RANKINGS USING HIGH REFRESH-RATE DISTRIBUTED CRAWLING
摘要 <p>A system for maximal gathering of fresh information added to a network such as the Internet and for processing the gathered fresh information. A link server (2) sends a batch of links to check (3) to a crawler (1B). Crawler (1B) them executes its crawling assignment by filtering the encountered content and extracting only that which is new or changed (4). Crawler (IB) then returns this content (4) to at least one data center and any interested web mining application (5). By using the crawlers (1A-E) to filter the data and only return or notify regarding the fresh content, less bandwidth is needed to get the information to the web mining application (5).</p>
申请公布号 WO2001086507(A1) 申请公布日期 2001.11.15
申请号 US2001014701 申请日期 2001.05.08
申请人 发明人
分类号 主分类号
代理机构 代理人
主权项
地址