发明名称 MODEL SELECTION FOR CLUSTER DATA ANALYSIS
摘要 A model selection method is provided for choosing the number of clusters, or more generally the parameters of a clustering algorithm. The algorithm is based on comparing the similarity between pairs of clustering runs on sub-samples or other perturbations of the data. High pairwise similarities show that the clustering represents a stable pattern in the data. The method is applicable to any clustering algorithm, and can also detect lack of structure. We show results on artificial and real data using a hierarchical clustering algorithm.
申请公布号 WO02095533(A3) 申请公布日期 2003.04.10
申请号 WO2002US15666 申请日期 2002.05.17
申请人 BIOWULF TECHNOLOGIES, LLC;BEN-HUR, ASA;ELISSEEFF, ANDRE;GUYON, ISABELLE 发明人 BEN-HUR, ASA;ELISSEEFF, ANDRE;GUYON, ISABELLE
分类号 G06F19/20;G06F19/24;G06G7/48;G06G7/58;G06K9/62 主分类号 G06F19/20
代理机构 代理人
主权项
地址