发明名称 |
OPTIMIZATION OF A PLURALITY OF TABLE PROCESSING OPERATIONS IN A MASSIVE PARALLEL PROCESSING ENVIRONMENT |
摘要 |
A computer-implemented method for partitioning data for a query operation of one table of the database system is provided. The computer-implemented method comprises estimating a value distribution of the attribute in the result table based on a first value distribution of the attribute in the first column of the first table. The computer-implemented method further comprises determining boundaries for partitioning ranges of the attribute, based on the estimated value distribution, wherein the partitioning ranges correspond to a same number of rows of the result table. The computer-implemented method further comprises partitioning the first table with processing nodes of the query operation, based on the determined boundaries of partitioning ranges. |
申请公布号 |
US2016098447(A1) |
申请公布日期 |
2016.04.07 |
申请号 |
US201414505715 |
申请日期 |
2014.10.03 |
申请人 |
International Business Machines Corporation |
发明人 |
Gaza Lukasz;Gruszecki Artur M.;Kazalski Tomasz;Skibski Konrad K.;Stradomski Tomasz |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computer-implemented method for partitioning data for a query operation in a database system, the query operation involving a first table having an attribute in a first column and resulting in a result table, the computer-implemented method comprising:
estimating a value distribution of the attribute in the result table based on a first value distribution of the attribute in the first column of the first table;
determining boundaries for partitioning ranges of the attribute, based on the estimated value distribution, wherein the partitioning ranges correspond to a same number of rows of the result table; andpartitioning the first table with processing nodes of the query operation, based on the determined boundaries of partitioning ranges. |
地址 |
Armonk NY US |