发明名称 METHOD FOR MAINTAINING A SAMPLE SYNOPSIS UNDER ARBITRARY INSERTIONS AND DELETIONS
摘要 A method of incrementally maintaining a stable, bounded, uniform random sample S from a dataset R, in the presence of arbitrary insertions and deletions to the dataset R, and without accesses to the dataset R, comprises a random pairing method in which deletions are uncompensated until compensated by a subsequent insertion (randomly paired to the deletion) by including the insertion's item into S if and only if the uncompensated deletion's item was removed from S (i.e., was in S so that it could be removed). A method for resizing a sample to a new uniform sample of increased size while maintaining a bound on the sample size and balancing cost between dataset accesses and transactions to the dataset is also disclosed. A method for maintaining uniform, bounded samples for a dataset in the presence of growth in size of the dataset is additionally disclosed.
申请公布号 US2008154541(A1) 申请公布日期 2008.06.26
申请号 US20060615481 申请日期 2006.12.22
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 GEMULLA RAINER;HAAS PETER J.;LEHNER WOLFGANG
分类号 G06F17/18;G06F17/30 主分类号 G06F17/18
代理机构 代理人
主权项
地址