发明名称 Method, apparatus and programmed medium for clustering databases with categorical attributes
摘要 The present invention relates to a computer method, apparatus and programmed medium for clustering databases containing data with categorical attributes. The present invention assigns a pair of points to be neighbors if their similarity exceeds a certain threshold. The similarity value for pairs of points can be based on non-metric information. The present invention determines a total number of links between each cluster and every other cluster bases upon the neighbors of the clusters. A goodness measure between each cluster and every other cluster based upon the total number of links between each cluster and every other cluster and the total number of points within each cluster and every other cluster is then calculated. The present invention merges the two clusters with the best goodness measure. Thus, clustering is performed accurately and efficiently by merging data based on the amount of links between the data to be clustered.
申请公布号 US6049797(A) 申请公布日期 2000.04.11
申请号 US19980055940 申请日期 1998.04.07
申请人 LUCENT TECHNOLOGIES, INC. 发明人 GUHA, SUDIPTO;RASTOGI, RAJEEV;SHIM, KYUSEOK
分类号 G06F17/30;G06K9/62;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址