发明名称 |
Method for building space-splitting decision tree |
摘要 |
A method is provided for data classification that achieves improved interpretability and accuracy while preserving the efficiency and scalability of univariate decision trees. To build a compact decision tree, the method searches for clusters in subspaces to enable multivariate splitting based on weighted distances to such a cluster. To classify an instance more accurately, the method performs a nearest neighbor (NN) search among the potential nearest leaf nodes of the instance. The similarity measure used in the NN search is based on Euclidean distances defined in different subspaces for different leaf nodes. Since instances are scored by their similarity to a certain class, this approach provides an effective means for target selection that is not supported well by conventional decision trees.
|
申请公布号 |
US6871201(B2) |
申请公布日期 |
2005.03.22 |
申请号 |
US20010918952 |
申请日期 |
2001.07.31 |
申请人 |
INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
YU PHILIP SHI-LUNG;WANG HAIXUN |
分类号 |
G06F7/00;G06F17/30;G06K9/62;(IPC1-7):G06F17/30 |
主分类号 |
G06F7/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|