发明名称 MINIMAL IMPACT CRAWLER
摘要 A system and method are provided that provide a minimal impact crawler (144) for searching and retrieving information on a distributed network. A policy engine (116) is provided that receives a request for a specific item and assembles policies for the target site containing information about the specific item. The policies are rules that determine the crawl (144) of a target site (128). The crawler (144) applies the policies to schedule crawls (144) of the target site (128) and stores data retrieved from the crawl (144) into a historical database (104) allowing future requests to be satisfied from the data stored in the database. A scheduling engine is implemented to automatically schedule crawls (144) at the beginning of an auction and at the end of an auction to minimize the number of crawls (144) on an auction site. The crawler (144) employs a plurality of minions (144) to retrieve crawl (144) requests and crawl (144) the target web sites (128) to obtain the necessary data.
申请公布号 WO0150320(A1) 申请公布日期 2001.07.12
申请号 WO2000US35169 申请日期 2000.12.21
申请人 AUCTIONWATCH.COM, INC. 发明人 COUSINS, ROBERT, E.;SLAYTON, MARC, A.;MARGOLIN, BENJAMIN
分类号 G06F17/30;(IPC1-7):G06F17/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址