发明名称 METHOD, APPARATUS, AND COMPUTER-READABLE MEDIUM FOR EFFICIENTLY PERFORMING OPERATIONS ON DISTINCT DATA VALUES
摘要 An apparatus, computer-readable medium, and computer-implemented method for efficiently performing operations on distinct data values, including storing a tokenized column of data in a table by mapping each unique data value in a corresponding domain to a unique entity ID, and replacing each of the data values in the column with the corresponding entity ID to generate a column of tokenized data containing one or more entity IDs, receiving a query directed to the column of data, the query defining one or more group sets for grouping the data retrieved in response to the query, and generating an entity map vector for each group set, the length of each entity map vector equal to the number of unique entity IDs for the domain, and the value of each bit in the entity map vector indicating the presence or absence of a different unique entity ID in the group set.
申请公布号 US2014279853(A1) 申请公布日期 2014.09.18
申请号 US201313835590 申请日期 2013.03.15
申请人 Grondin Richard;Fadeitchev Evgueni 发明人 Grondin Richard;Fadeitchev Evgueni
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for efficiently performing operations on distinct data values by one or more computing devices, the method comprising: storing, by at least one of the one or more computing devices, a tokenized column of data in a table, the tokenized column of data created by mapping each unique data value in a domain which corresponds to a column of data to an entity ID, and replacing each of the data values in the column with the corresponding entity ID to generate the column of tokenized data containing one or more entity IDs; receiving, by at least one of the one or more computing devices, a query directed to the column of data, the query defining one or more group sets for grouping the data retrieved in response to the query; and generating, by at least one of the one or more computing devices, an entity map vector for each group set in the one or more group sets, the length of each entity map vector equal to the total number of entity IDs for the domain, and the value of each bit in each entity map vector indicating the presence or absence of a different entity ID in the corresponding group set.
地址 Ste-Julie CA