发明名称 Index server architecture using tiered and sharded phrase posting lists
摘要 An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are extracted from the document collection. Documents are indexed according to their included phrases, using phrase posting lists. The phrase posting lists are stored in an cluster of index servers. The phrase posting lists can be tiered into groups, and sharded into partitions. Phrases in a query are identified based on possible phrasifications. A query schedule based on the phrases is created from the phrases, and then optimized to reduce query processing and communication costs. The execution of the query schedule is managed to further reduce or eliminate query processing operations at various ones of the index servers.
申请公布号 US7693813(B1) 申请公布日期 2010.04.06
申请号 US20070694780 申请日期 2007.03.30
申请人 GOOGLE INC. 发明人 CAO PEI;EIRON NADAV;MAZUMDAR SOHAM;PATTERSON ANNA;POWER RUSSELL;ZUNGER YONATAN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址