A method for machine learning a data set in a data processing framework is disclosed. A forest is trained with the data set that generates a plurality of trees in parallel. Each tree includes leaf nodes having a constant weight. A discriminative value for each leaf node is learned with a supervised model. The forest is reconstructed with the discriminative values replacing the constant weight for each leaf node.