发明名称 OPTIMIZATION OF A PLURALITY OF TABLE PROCESSING OPERATIONS IN A MASSIVE PARALLEL PROCESSING ENVIRONMENT
摘要 A computer-implemented method for partitioning data for a query operation of one table of the database system is provided. The computer-implemented method comprises estimating a value distribution of the attribute in the result table based on a first value distribution of the attribute in the first column of the first table. The computer-implemented method further comprises determining boundaries for partitioning ranges of the attribute, based on the estimated value distribution, wherein the partitioning ranges correspond to a same number of rows of the result table. The computer-implemented method further comprises partitioning the first table with processing nodes of the query operation, based on the determined boundaries of partitioning ranges.
申请公布号 US2016098447(A1) 申请公布日期 2016.04.07
申请号 US201414505715 申请日期 2014.10.03
申请人 International Business Machines Corporation 发明人 Gaza Lukasz;Gruszecki Artur M.;Kazalski Tomasz;Skibski Konrad K.;Stradomski Tomasz
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method for partitioning data for a query operation in a database system, the query operation involving a first table having an attribute in a first column and resulting in a result table, the computer-implemented method comprising: estimating a value distribution of the attribute in the result table based on a first value distribution of the attribute in the first column of the first table; determining boundaries for partitioning ranges of the attribute, based on the estimated value distribution, wherein the partitioning ranges correspond to a same number of rows of the result table; andpartitioning the first table with processing nodes of the query operation, based on the determined boundaries of partitioning ranges.
地址 Armonk NY US