发明名称 Multi-dimensional database record compression utilizing optimized cluster models
摘要 Apparatus and method for use in querying a database containing data records. The database is characterized by a compression scheme to provide data clustering information. In accordance with a exemplary embodiment of the invention a functional representation of data clustering is a Gaussian and the queries are performing by integrating the Gaussian corresponding to each of the data clusters over the ranges to determine the sum or the count of data records from the database that fall within the selected ranges. The process chooses a value for the cluster number K. The cluster model is next broken up into areas (tiles) based on user defined parameters. Data from the database is then classified based on the tiling information. A sorted version of the classified data, ordered by cluster number and then by the tile number within the cluster is generated. This data is then evaluated to test the sufficiency of the model created during the clustering.</PTEXT>
申请公布号 US6633882(B1) 申请公布日期 2003.10.14
申请号 US20000606964 申请日期 2000.06.29
申请人 MICROSOFT CORPORATION 发明人 FAYYAD USAMA;SHANMUGASUNDARAM JAYAVEL
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址