发明名称 Methods and systems for biclustering algorithm
摘要 Methods and systems for improved unsupervised learning are described. The unsupervised learning can consist of biclustering a data set, e.g., by biclustering subsets of the entire data set. In an example, the biclustering does not include feeding know and proven results into the biclustering methodology or system. A hierarchical approach can be used that feeds proven clusters back into the biclustering methodology or system as the input. Data that does not cluster may be discarded. Thus, a very large unknown data set can be acted on to learn about the data. The system is also amenable to parallelization.
申请公布号 US9043326(B2) 申请公布日期 2015.05.26
申请号 US201213385042 申请日期 2012.01.30
申请人 The Curators of the University of Missouri 发明人 Wunsch, II Donald Coolidge;Xu Rui;Kim Sejun
分类号 G06F17/30;G06F19/24 主分类号 G06F17/30
代理机构 Billion & Armitage 代理人 Billion & Armitage ;Collins Michael A.;Armitage Benjamin C.
主权项 1. A method for clustering information from a data set, comprising: creating first clusters of related data from a first subspace of unclustered data in the data set, wherein related data added to each one of the first clusters shares at least one attribute in the first subspace; and creating second clusters of related data from a second subspace of unclustered data in the data set, wherein adding the related data from the second subspace to one of the second clusters requires satisfying a vigilance parameter to ensure each one of the second clusters of related data shares at least one attribute in the second subspace; and providing feedback to increase the vigilance parameter if the related data from the second subspace added to the one of the second clusters does not share at least one attribute associated with one of the first clusters, wherein increasing the vigilance parameter forces the related data from the second subspace to be added to a new or different one of the second clusters.
地址 Columbia MO US