发明名称 Clustering mixed attribute patterns
摘要 A technique for clustering data points in a data set that is arranged as a matrix having n objects and m attributes. Each categorical attribute of the data set is converted to a 1-of-p representation of the categorical attribute. A converted data set A is formed based on the data set and the 1-of-p representation for each categorical attribute. The converted data set A is compressed using, for example, a Goal Directed Projection compression technique or a Singular Value Decomposition compression technique, to obtain q basis vectors, with q being defined to be at least m+1. The transformed data set is projected onto the q basis vectors to form a data matrix having at least one vector, with each vector having q dimensions. Lastly, a clustering technique is performed on the data matrix having vectors having q dimensions.
申请公布号 US6260038(B1) 申请公布日期 2001.07.10
申请号 US19990394883 申请日期 1999.09.13
申请人 INTERNATIONAL BUSINEMSS MACHINES CORPORATION 发明人 MARTIN DAVID C.;MODHA DHARMENDRA SHANTILAL;VAITHYANATHAN SHIVAKUMAR
分类号 G06F17/30;G06K9/62;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址