发明名称 |
Method, system and computer program product for organizing data |
摘要 |
Methods, systems and computer program products for organizing semi-structured data into taxonomies are described. The semi-structured data is clustered into a plurality of clusters based on a plurality of attributes and the clusters are ranked relative to each other. The attributes are also ranked relative to each other based on a common ranking measure suitable for each of the attributes. The taxonomy may be represented as a hierarchical tree structure comprising a root node and a plurality of child nodes with the root node containing the semi-structured data and each of the child nodes containing data points of a cluster generated from the semi-structured data.
|
申请公布号 |
US2007143235(A1) |
申请公布日期 |
2007.06.21 |
申请号 |
US20050314596 |
申请日期 |
2005.12.21 |
申请人 |
INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
KUMMAMURU KRISHNA;KANKAR PANKAJ |
分类号 |
G06N3/02 |
主分类号 |
G06N3/02 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|