发明名称 Methods and apparatus for user-centered web crawling
摘要 Techniques are provided for user-centered search and crawling on an information network such as the world wide web. The techniques identify the nature of the web pages which are most relevant to a given predicate. The behavior of users is used to identify and determine the web pages which are most relevant to a specific crawl. Thus, the techniques are implemented in a web crawling system which can obtain the web pages specific to a given topic by leveraging the nature of the interests of the users in different topics.
申请公布号 US2004205049(A1) 申请公布日期 2004.10.14
申请号 US20030410846 申请日期 2003.04.10
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 AGGARWAL CHARU C.
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址