发明名称 PARTITIONED JOINS IN ANALYTICAL DATABASES
摘要 A first data table and a second table to be joined is determined. The first data table and the second data table have a join attribute. Data blocks of the first data table are stored on a storage device. An attribute value information for the join attribute for the data block of the first data table is determined. At least one partition for the first data table and the second table is defined using at least he attribute value information on the join attribute. Each partition of the at least one partition has a respective partition range of values of the join attribute. A pair wise partition join is processed on a first partition range of the determined partition ranges.
申请公布号 US2017116267(A1) 申请公布日期 2017.04.27
申请号 US201514919092 申请日期 2015.10.21
申请人 International Business Machines Corporation 发明人 Boehme Thomas;Brodt Andreas;Koeth Oliver;Schiller Oliver
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer implemented method for performing a partitioned join, the method comprising: determining, by one or more computer processors, a first data table and a second data table to be joined, wherein the first data table and the second data table have a join attribute, and wherein data blocks of the first data table are stored on a storage device; determining, by one or more computer processors, an attribute value information of the join attribute for the data blocks of the first data table; defining, by one or more computer processors, at least one partition for the first data table and the second data table using at least the attribute value information on the join attribute, wherein each partition of the at least one partition has a respective partition range of values of the join attribute; and processing, by one or more computer processors, a pair wise partition join on a first partition range of the determined partition ranges comprising: determining, by one or more computer processors, a first data blocks of the first data table that fully fall under the first partition range using the attribute value information, reading the determined first data blocks from a disk into a memory and feeding the determined first data blocks to the partition join on the first partition range with the second data table; anddetermining, by one or more computer processors, a second data blocks of the first data table that partially fall under the first partition range using the attribute value information and selectively feeding records of the determined second data blocks that fall under the first partition range to the processing of the first partition range with the second data table.
地址 Armonk NY US