发明名称 Data classification and hierarchical clustering
摘要 Apparatus, systems, and methods can operate to provide efficient data clustering, data classification, and data compression. A method comprises training set of training instances can be processed to select a subset of size-1 patterns, initialize a weight of each size-1 pattern, include the size-1 patterns in classes in a model associated with the training set, and then include a set of top-k size-2 patterns in a way that provides an effective balance between local, class, and global significance patterns. A method comprises processing a dataset to compute an overall significance value of each size-2 pattern in each instance in the dataset, sort the size-2 patterns, and select the top-k size-2 patterns to be represented in clusters, which can be refined into a clustered hierarchy. A method comprises creating an uncompressed bitmap, reordering the bitmap, and compressing the bitmap. Additional apparatus, systems, and methods are disclosed.
申请公布号 US8407164(B2) 申请公布日期 2013.03.26
申请号 US20080602908 申请日期 2008.06.11
申请人 MALIK HASSAN HAIDER;KENDER JOHN RONALD;THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK 发明人 MALIK HASSAN HAIDER;KENDER JOHN RONALD
分类号 G06F15/18 主分类号 G06F15/18
代理机构 代理人
主权项
地址