摘要 |
PROBLEM TO BE SOLVED: To improve clustering quality by removing elements damaging in-cluster consistency.SOLUTION: As a set of data, a set {C} of first clusters Cconstituted by a first clustering method and a set {Q} of second clusters Qconstituted by a second clustering method different from the first clustering method are obtained, and with respect to each of the first clusters C, a cluster Qthe number of whose common elements with the cluster Cis the largest is selected from the set {Q} of the second clusters, and a product set Iof the first cluster Cand the second cluster Qcorresponding to this is obtained as a third cluster, and a set {I} of the obtained third clusters is output. |