发明名称 Method and system for squashing a large data set
摘要 Apparatus and method for summarizing an original large data set with a representative data set. The data elements in both the original data set and the representative data set have the same variables, but there are significantly fewer data elements in the representative data set. Each data element in the representative data set has an associated weight, representing the degree of compression. There are three steps for constructing the representative data set. First, the original data elements are partitioned into separate bins. Second, moments of the data elements partitioned in each bin are calculated. Finally, the representative data set is generated by finding data elements and associated weights having substantially the same moments as the original data set.
申请公布号 US6539391(B1) 申请公布日期 2003.03.25
申请号 US19990373823 申请日期 1999.08.13
申请人 AT&T CORP. 发明人 DUMOUCHEL WILLIAM H.;VOLINSKY CHRISTOPHER T.;JOHNSON THEODORE J.;CORTES CORINNA;PREGIBON DARYL
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址