发明名称 AVOIDING MASKED WEB PAGE CONTENT INDEXING ERRORS FOR SEARCH ENGINES
摘要 Multiple non-host client sites provide cached user copies of web pages and/or web content, or summaries thereof, to a server. Obtaining data from non-host sources for indexing purposes avoids masked web page content indexing errors for search engines. The server aggregates, summarizes and indexes the web pages and/or web content in an index of cached content, in conjunction with updating, generating and storing a search index using an indexing agent such as a web crawler or spider. In response to receiving search requests from end users, the search engine uses comparisons between the index of cached content and the index of crawled content to identify potential page masking errors for specific search results and appropriately rank or omit results with a high risk of masking errors in a search result list.
申请公布号 US2009265342(A1) 申请公布日期 2009.10.22
申请号 US20090425269 申请日期 2009.04.16
申请人 SHUSTER GARY STEPHEN 发明人 SHUSTER GARY STEPHEN
分类号 G06F17/30;G06F12/08 主分类号 G06F17/30
代理机构 代理人
主权项
地址