发明名称 Incremental maintenance of an approximate histogram in a database system
摘要 Techniques for maintaining an approximate histogram of a relation in a database, in the presence of updates to the relation. The histogram includes a number of subsets, or "buckets," each representing at least one possible value of an attribute of the relation. Each of the subsets has a count associated therewith indicative of the frequency of occurrence of the corresponding value of the attribute. After an update to the relation, the counts associated with the subsets are compared to a threshold. If the count associated with a given subset exceeds the threshold, the given subset is separated at its median into two separate subsets. After the separation operation, the two subsets with the lowest counts are combined such that a constant number of subsets are maintained in the histogram, if the total combined count of the subsets does not exceed the threshold. If no two subsets have a total combined count which does not exceed the threshold, the histogram is recomputed from a random sample of the relation. The invention substantially reduces the number of times the histogram must be recomputed from the random sample, and is particularly well-suited for use with approximate equi-depth and compressed histograms.
申请公布号 US5870752(A) 申请公布日期 1999.02.09
申请号 US19970915804 申请日期 1997.08.21
申请人 LUCENT TECHNOLOGIES INC.;NCR CORPORATION 发明人 GIBBONS, PHILLIP B.;MATIAS, YOSSI;POOSALA, VISWANATH;WITKOWSKI, ANDREW
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址