摘要 |
Methods and apparatus, including computer systems and program products, for processing queries for which a solution requires that an information management system perform logical operations on a data repository. In general, in one aspect, the techniques feature a method of executing queries on a data repository. That method includes receiving a query, adapted for execution on a data set in the data repository; defining a sample of the data set, where the sample is a subset of the data set; executing the query on the sample; generating an estimate of a result of the execution of the query on the sample; and providing the estimate to a user interface. The method may further include defining an Nth sample, such that the Nth sample is larger than an (N-1) th sample, and generating an Nth estimate of the result based on the execution of the query on the Nth sample.
|