摘要 |
PROBLEM TO BE SOLVED: To provide a technique for comparing decision trees in detail without depending on a difference of tree structures thereof. SOLUTION: A data set storage section stores a plurality of data sets, which are sets of a plurality of instances respectively having the same kind of target attribute. A decision tree information storage section stores a plurality of decision trees respectively generated from different data sets. A target attribute determination section determines a value of a target attribute having many instances to be classified in the process of generating a decision tree for a node as a label of the node, for each node of the decision tree. A basic frequency calculation section calculates a frequency at which an instance having the same target attribute as a label of a node is classified in the process of generating a decision tree, for each node. An application frequency calculation section makes a decision tree classify an instance which has caused another decision tree to be generated, and calculates a frequency at which the instance having the same target attribute as a label of the node is classified, for each node of the decision tree. An output section outputs a result of comparing two frequencies as a comparison result of the decision trees. COPYRIGHT: (C)2010,JPO&INPIT
|