发明名称 Model selection for cluster data analysis
摘要 A model selection method is provided for choosing the number of clusters, or more generally the parameters of a clustering algorithm. The algorithm is based on comparing the similarity between pairs of clustering runs on sub-samples or other perturbations of the data. High pairwise similarities show that the clustering represents a stable pattern in the data. The method is applicable to any clustering algorithm, and can also detect lack of structure. We show results on artificial and real data using a hierarchical clustering algorithm.
申请公布号 AU2002259250(A1) 申请公布日期 2002.12.03
申请号 AU20020259250 申请日期 2002.05.17
申请人 BIOWULF TECHNOLOGIES, LLC 发明人 ANDRE ELISSEEFF;ASA BEN-HUR;ISABELLE GUYON
分类号 G06F19/20;G06F19/24;G06G7/48;G06G7/58;G06K9/62 主分类号 G06F19/20
代理机构 代理人
主权项
地址