发明名称 Resource Download Policies Based On User Browsing Statistics
摘要 Web crawling polices are generated based on user web browsing statistics. User browsing statistics are aggregated at the granularity of resource identifier patterns (such as URL patterns) that denote groups of resources within a particular domain or website that share syntax at a certain level of granularity. The web crawl policies rank the resource identifier patterns according to their associated aggregated user browsing statistics. A crawl ordering defined by the web crawl polices is used to download and discover new resources within a domain or website.
申请公布号 US2012303606(A1) 申请公布日期 2012.11.29
申请号 US201113114643 申请日期 2011.05.24
申请人 CAI RUI;FAN XIAODONG;ZHANG LEI;MICROSOFT CORPORATION 发明人 CAI RUI;FAN XIAODONG;ZHANG LEI
分类号 G06F17/30;G06F15/173 主分类号 G06F17/30
代理机构 代理人
主权项
地址