发明名称 Index replication in distributed search engines
摘要 Briefly, embodiments of methods or systems to replicate indexes are described. According to an embodiment, a method may include executing instructions by one or more processors to bring about generating a first replication threshold of documents to be replicated at a local computing site and a second replication threshold of document entries to be stored in a posting list at the local computing site.
申请公布号 US9460226(B2) 申请公布日期 2016.10.04
申请号 US201213536551 申请日期 2012.06.28
申请人 Yahoo! Inc. 发明人 Leroy Vincent;Morel Matthieu;Junqueira Flavio
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Berkeley Law & Technology Group, LLP 代理人 Berkeley Law & Technology Group, LLP
主权项 1. A method of executing computer content, code, or instructions stored as memory states in one or more physical memory devices accessible by one or more processors of a computing device, comprising: accessing the content, code, or instructions from the one or more physical memory devices for execution by the one or more processors of the computing device; executing the accessed content, code, or instructions from the one or more physical memory devices of the computing device; andstoring, in at least one memory of the computing device, binary digital signal quantities resulting from having executed the accessed content, code, or instructions on the one or more processors of the computing device, whereinthe storing of the binary digital signal quantities results, at least in part, from the one or more processors of the computing device executing the accessed content, code, or instructions to assign a split parameter to a local computing site, the split parameter to indicate a portion of a set of electronic documents or at one or more remote computing sites to be replicated at the local computing site relative to a portion of the set of electronic documents at the one or more remote computing sites to be replicated into a posting list entry at the local computing site based on a replication budget to identify a capacity to store replicated electronic documents and replicated posting list entries of electronic documents at the local computing site, and wherein the one or more processors of the computing device executing the accessed content, code, or instructions to generate, in response to one or more search terms received at the local computing site, a first replication threshold based on the split parameter and based on one or more partial scores of the portion of the set of electronic documents at the one or more remote computing sites to be replicated at the local computing site relative to the one or more received search terms, wherein the portion of the set of electronic documents at the one or more remote computing sites comprising partial scores greater than the first replication threshold are to be replicated at the local computing site, and wherein the one or more processors of the computing device executing the accessed content, code, or instructions to generate a second replication threshold based on the split parameter and based on one or more partial scores of posting list entries of the portion of the set of electronic documents stored at the one or more remote computing site relative to the one or more received search terms, wherein the posting list entries, comprising partial scores greater than the second replication threshold, are to be replicated at the local computing site and to reference the electronic documents of the posting list entries stored at the one or more remote computing sites.
地址 Sunnyvale CA US