发明名称 Method and apparatus for processing database data in distributed database system
摘要 A method and apparatus for processing database data in a distributed database system, wherein the distributed database system comprises a plurality of computing nodes communicatively coupled via computer networks, the method comprising: creating a plurality of different data replicas wherein each of the data replicas is created in the following way: sorting the database data according to at least one data attribute; generating a row key based on the at least one data attribute; and using the sorted database data with the row key as the data replica, storing different data replicas in different computing nodes; and creating an index for each of the data replicas according to its row key.
申请公布号 US9411867(B2) 申请公布日期 2016.08.09
申请号 US201313929283 申请日期 2013.06.27
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Chen Guan Cheng;Shi Ju Wei;Wang Kun;Yang Ben Bo;Zou Jia
分类号 G06F7/00;G06F17/00;G06F17/30 主分类号 G06F7/00
代理机构 Scully, Scott, Murphy & Presser, P.C. 代理人 Scully, Scott, Murphy & Presser, P.C. ;Davis, Esq. Jennifer
主权项 1. A method for processing database data in a distributed database system, wherein the distributed database system comprises a plurality of computing nodes communicatively coupled via computer networks, the method comprising: creating a plurality of different data replicas, wherein each of the data replicas is created in the following way: (a) sorting the database data according to at least two data attributes, wherein the database data is sorted differently based on the data attributes for each of the data replicas; (b) generating a row key based on the at least two data attributes; and (c) using the sorted database data with the row key as the data replica; storing different data replicas in different computing nodes; and creating an index for each of the data replicas according to its row key; wherein the at least two data attributes that are used as the basis for sorting of the database data comprise an attribute indicative of time and an attribute indicative of a device identifier; wherein the sorting of the database data of a first one of the data replicas comprises first sorting the database data by the attribute indicative of the device identifier and then sorting the database data by the attribute indicative of the time; and wherein the sorting of the database data of a second one of the data replicas comprises first sorting the database data by the attribute indicative of the time and then sorting the database data by the attribute indicative of the device identifier.
地址 Armonk NY US