发明名称 Generating decision trees with discriminants and employing the same in data classification
摘要 At least a portion of a decision tree structure is generated from one or more multidimensional data objects by representing data associated with one or more of the data objects as a node, determining a condition for dividing the data at the node into at least two subsequent nodes based on a discriminant measure which maximises the separation between classes associated with the data, and dividing the data according to the condition. The multidimensional objects may be data records including feature variables and class variables and the method comprises splitting a decision tree, recursively, such that the greatest amount of separation among the class values of the data is achieved. The discriminant measure is preferably determined in accordance with Fisher's discriminant technique and the data is divided at a split plane determined to be perpendicular to a direction determined according to said technique and where an entropy measure is substantially optimised, as determined in accordance with a gini index.
申请公布号 GB2369697(A) 申请公布日期 2002.06.05
申请号 GB20010009736 申请日期 2001.04.20
申请人 * INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CHARU C * AGGARWAL;PHILIP SHI-LUNG * YU;PHILIP SHI-LUNG * YU;CHARU C * AGGARWAL
分类号 G06F9/44;G06F17/30;G06K9/62;G06N5/04;(IPC1-7):G06F17/30 主分类号 G06F9/44
代理机构 代理人
主权项
地址