发明名称 Web crawler scheduler that utilizes sitemaps from websites
摘要 Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
申请公布号 US8037054(B2) 申请公布日期 2011.10.11
申请号 US20100823358 申请日期 2010.06.25
申请人 GOOGLE INC. 发明人 BRAWER SASCHA B.;IBEL MAXIMILIAN;KELLER RALPH MICHAEL;SHIVAKUMAR NARAYANAN
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址