摘要 |
A parallel database system includes multiple processing units and associated storage modules. To achieve even distribution of workload, rows of a table are distributed as evenly as possible across the multiple processing units using a predefined partitioning algorithm. The predefined partitioning algorithm defines groups and probabilities assigned to boundary points defining the groups. Partitioning of data is based on the boundary points and the assigned probabilities.
|