发明名称 Creation of inverted index system, and data processing method and apparatus
摘要 The present disclosure relate to techniques for establishing an inverted indexing system and related data processing. The techniques may include writing, by a computing device, inverted indexes of a massive amount of data records into at least one inverted file. The computing device may then write description information of the written inverted file into a description file associated with the inverted file, and establish the inverted indexing system based on the inverted file and the description file of the inverted file. The techniques enhance efficiency in establishing the inverted indexing system and in processing data using the systems.
申请公布号 US9256665(B2) 申请公布日期 2016.02.09
申请号 US201314045613 申请日期 2013.10.03
申请人 Alibaba Group Holding Limited 发明人 Qin Jian
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Lee & Hayes, PLLC 代理人 Lee & Hayes, PLLC
主权项 1. A method for establishing an indexing system, the method comprising: writing, by a computing device including one or more processors, multiple inverted indexes of data records into at least one inverted file; writing, by the computing device, description information of the at least one inverted file to a description file corresponding to the at least one inverted file; and establishing, by the computing device, the inverted indexing system based on the at least one inverted file and the description file corresponding to the at least one inverted file, wherein the establishing the inverted indexing system based on the at least one inverted file and the description file corresponding to the at least one inverted file comprises: selecting an attribute of the data records as a function argument,retrieving a function value based on a predetermined functional mapping relation,combining the at least one inverted file and the description file corresponding to the at least one inverted file that corresponds to multiple data records that have the function value, into an indexing segment partition, andestablishing the inverted indexing system based on the indexing segment partition.
地址 Grand Cayman KY