发明名称 Modifying web pages to reduce retrieval latency
摘要 Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating modified web documents. In one aspect, a method includes retrieving a web document including markup language code and having references to resources external to the web document and parsing the retrieved web document to interpret the markup language code and identify references to resources external to the retrieved web document. Data relating to at least a portion of the resources external to the retrieved web document are retrieved, and a modified web document including the retrieved data is generated and stored for use in responding to a request for retrieval of content of the web document.
申请公布号 US8977653(B1) 申请公布日期 2015.03.10
申请号 US201012818068 申请日期 2010.06.17
申请人 Google Inc. 发明人 Mahkovec Ziga;Kapoor Rupesh
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A method, comprising: independent of receipt of a search request: retrieving a plurality of web documents, each web document including markup language code and at least one reference to a resource external to the web document;parsing each of the plurality of retrieved web documents, by operation of a computer, to interpret the markup language code and to identify the at least one reference to a resource external to the web document;retrieving referenced data from the resource external to each of the plurality of retrieved web documents;adding, by operation of a computer, to each web document of the plurality of web documents, retrieved referenced data associated with the web document, to generate a plurality of modified web documents, wherein each modified web document contains the retrieved referenced data associated with the modified web document and the interpreted markup language code, and wherein generating a particular modified web document includes: generating a document object model tree based on the retrieved referenced data and at least a portion of the interpreted markup language code from the associated retrieved web document; andgenerating the particular modified web document based on the document object model tree; andstoring each of the plurality of modified web documents for use in responding to a request for retrieval of content for a particular web document; receiving a search request; and returning, in response to the search request, a plurality of search results, each search result that is associated with a retrieved one of the plurality of web documents comprising an image preview of and a link to a particular stored modified web document associated with the retrieved web document, the image preview visually displayed adjacent to the search result and providing a static representation of a visual appearance of the modified web document.
地址 Mountain View CA US