发明名称 |
Method for partitioning a data set into frequency vectors for clustering |
摘要 |
A method of partitioning a data set in which certain elements of the data set are first identified as robust discriminator data elements. For the other non-discriminator data elements, an embodiment of the invention counts occurrences of a predetermined relationship between each non-discriminator data element and the identified robust discriminator data elements, and maps the counted occurrences onto vectors a multi-dimensional frequency space. Finally, an embodiment forms the frequency vectors into clusters according to a distance or adjacency metric, where each cluster represents a different contextual class of meaningful attributes. The data set is thereby partitioned into an arbitrary number of clusters according to the discovered relationships between the non-discriminator data elements and the robust discriminator data elements so that all of the non-discriminator data elements located in the same cluster possess similar attributes.
|
申请公布号 |
US2003033138(A1) |
申请公布日期 |
2003.02.13 |
申请号 |
US20020260294 |
申请日期 |
2002.10.01 |
申请人 |
BANGALORE SRINIVAS;RICCARDI GIUSEPPE |
发明人 |
BANGALORE SRINIVAS;RICCARDI GIUSEPPE |
分类号 |
G06K9/62;(IPC1-7):G10L19/14 |
主分类号 |
G06K9/62 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|