发明名称 A SCALABLE SYSTEM FOR CLUSTERING OF LARGE DATABASES
摘要 <p>In a data mining system (12), clusters are used to categorize data within each model. An initial set of estimates of the parameters of each model and each cluster are provided. A portion of the data in the database (10) is read from a storage medium and brought into a rapid access memory buffer (22). Data contained in the data buffer (22) is used to update the original guesses at the parameters of the model in each cluster over all models. Some of the data belonging to a cluster is summarized or compressed and stored as a reduced form of the data representing sufficient statistics of the data. If further data is needed to categorize the cluster, more data is gathered from the database (10) and used in combination with compressed data until a stopping criteria (140) is met.</p>
申请公布号 WO1999048018(A1) 申请公布日期 1999.09.23
申请号 US1999005759 申请日期 1999.03.16
申请人 发明人
分类号 主分类号
代理机构 代理人
主权项
地址