发明名称 A SCALABLE SYSTEM FOR CLUSTERING OF LARGE DATABASES HAVING MIXED DATA ATTRIBUTES
摘要 <p>A scalable clustering algorithm (12) accesses database (10) of records having attributes or data fields of both enumerated discrete and ordered values and brings a portion of the data records into a rapid access memory. A cluster model for the data includes a table of probabilities (160) for the enumerated, discrete data fields of the data records. The cluster model for data fields that are ordered comprises a mean and spread of the cluster. The cluster model is updated from the database records brought into the rapid access memory. Some of the database records in the rapid access memory are summerized and stored within the rapid access memory. A criteria is evaluated to dermine if further data should be accessed from the database to further cluster data records in the database. Additional database records in the database are accessed and brought into the rapid access memory for further updating of the cluster model.</p>
申请公布号 WO1999062007(A1) 申请公布日期 1999.12.02
申请号 US1999006717 申请日期 1999.03.29
申请人 发明人
分类号 主分类号
代理机构 代理人
主权项
地址