发明名称 Random sampling of rows in a parallel processing database system
摘要 A method, apparatus, and article of manufacture for random sampling of rows stored in a table, wherein the table has a plurality of partitions. A row count is determined for each of the partitions of the table and a total number of rows in the table is determined from the row count for each of the partitions of the table. A proportional allocation of a sample size is computed for each of the partitions based on the row count and the total number of rows. A sample set of rows of the sample size is retrieved from the table, wherein each of the partitions of the table contributes its proportional allocation of rows to the sample set of rows. Preferably, the computer system is a parallel processing database system, wherein each of its processing units manages a partition of the table, and some of the above steps can be performed in parallel by the processing units.
申请公布号 US6564221(B1) 申请公布日期 2003.05.13
申请号 US19990457274 申请日期 1999.12.08
申请人 NCR CORPORATION 发明人 SHATDAL AMBUJ
分类号 G06F17/30;(IPC1-7):G06F7/00;G06F17/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址