发明名称 Method and device for clustering categorical data and identifying anomalies, outliers, and exemplars
摘要 One aspect of the invention is a method for assigning categorical data to a plurality of clusters. The method may include identifying a plurality of categories associated with the data. The method also may include, for each category in the plurality of categories, identifying at least one element associated with the category. The method also may include specifying a number of clusters to which the data may be assigned. The method additionally may include assigning at least some of the data, wherein each assigned datum is assigned to a respective one of the clusters. The method further may include, for at least one of the clusters, determining, for at least one category, the frequency in data assigned to the cluster of at least one element associated with the category. Further, the invention may provide for detecting outliers, anomalies, and exemplars in the categorical data.
申请公布号 US8090721(B2) 申请公布日期 2012.01.03
申请号 US20100714489 申请日期 2010.02.27
申请人 FOGEL DAVID B.;NATURAL SELECTION, INC. 发明人 FOGEL DAVID B.
分类号 G06F17/30;G06F7/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址