摘要 |
<p>In a data mining system (12), clusters are used to categorize data within each model. An initial set of estimates of the parameters of each model and each cluster are provided. A portion of the data in the database (10) is read from a storage medium and brought into a rapid access memory buffer (22). Data contained in the data buffer (22) is used to update the original guesses at the parameters of the model in each cluster over all models. Some of the data belonging to a cluster is summarized or compressed and stored as a reduced form of the data representing sufficient statistics of the data. If further data is needed to categorize the cluster, more data is gathered from the database (10) and used in combination with compressed data until a stopping criteria (140) is met.</p> |