发明名称 Distributed decision tree training
摘要 A computerized decision tree training system may include a distributed control processing unit configured to receive input of training data for training a decision tree. The system may further include a plurality of data batch processing units, each data batch processing unit being configured to evaluate each of a plurality of split functions of a decision tree for respective data batch of the training data, to thereby compute a partial histogram for each split function, for each datum in the data batch. The system may further include a plurality of node batch processing units configured to aggregate the associated partial histograms for each split function to form an aggregated histogram for each split function for each of a subset of frontier tree nodes and to determine a selected split function for each frontier tree node by computing the split function that produces highest information gain for the frontier tree node.
申请公布号 US8543517(B2) 申请公布日期 2013.09.24
申请号 US20100797430 申请日期 2010.06.09
申请人 SHOTTON JAMIE;BUDIU MIHAI-DAN;FITZGIBBON ANDREW WILLIAM;FINOCCHIO MARK;MOORE RICHARD E.;ROBERTSON DUNCAN;MICROSOFT CORPORATION 发明人 SHOTTON JAMIE;BUDIU MIHAI-DAN;FITZGIBBON ANDREW WILLIAM;FINOCCHIO MARK;MOORE RICHARD E.;ROBERTSON DUNCAN
分类号 G06F15/18 主分类号 G06F15/18
代理机构 代理人
主权项
地址