发明名称 METHOD AND SYSTEM FOR IDENTIFYING CLUSTERS WITHIN A COLLECTION OF DATA ENTITIES
摘要 Embodiments of a method and system for identifying clusters in collections of data entities are generally described herein. In some embodiments, the method includes defining a metric space over the data entities. A distance function of the metric space may satisfy the triangle inequality. The method may include determining, based on the distance function of the metric space, a value for a number of clusters that minimizes a number of data bits used to define a model of the collection of the data entities. The model may thereby describe the collection of data entities using a minimum description length (MDL). The method may include assigning data entities of the collection of data entities to the clusters. The number of clusters to which the data entities are assigned may correspond to the determined value.
申请公布号 US2014149410(A1) 申请公布日期 2014.05.29
申请号 US201213686995 申请日期 2012.11.28
申请人 KENEFIC RICHARD J.;WATTS JOHN G.;RAYTHEON COMPANY 发明人 KENEFIC RICHARD J.;WATTS JOHN G.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址