发明名称 Decision tree induction that is sensitive to attribute computational complexity
摘要 A decision tree for classifying computer files is constructed. Computational complexities of a set of candidate attributes are determined. A set of attribute vectors are created for a set of training files with known classification. A node is created to represent the set. A weighted impurity reduction score is calculated for each candidate attribute based on the computational complexity of the attribute. If a stopping criterion is satisfied then the node is set as a leaf node. Otherwise the node is set as a branch node and the attribute with the highest weighted impurity reduction score is selected as the splitting attribute for the branch node. The set of attribute vectors are split into subsets based on their attribute values of the splitting attribute. The above process is repeated for each subset. The tree is then pruned based on the computational complexities of the splitting attributes.
申请公布号 US8495096(B1) 申请公布日期 2013.07.23
申请号 US201213450390 申请日期 2012.04.18
申请人 PEREIRA SHANE;RAMZAN ZULFIKAR;SATISH SOURABH;SYMANTEC CORPORATION 发明人 PEREIRA SHANE;RAMZAN ZULFIKAR;SATISH SOURABH
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址