摘要 |
An apparatus, system, and method are disclosed for efficient adaptive parallel data clustering for loading data into a table by generating a hint-key for each of one or more records in the input data stream, ordering the one or more records in a first-level clustering agent to generate one or more record lists ordered by hint-key. The apparatus, system, and method simultaneously processes one or more record lists in a second-level clustering agent, outputs the one or more records of the one or more record lists clustered by the hint-key of the one or more records, stores, in a partial block cache, a plurality of partial blocks that are output by the second-level clustering agent, and stores, in a partial page cache, a plurality of last partial pages of the partial blocks that have been victimized from the partial block cache.
|