发明名称 |
Distributed decision tree training |
摘要 |
A computerized decision tree training system may include a distributed control processing unit configured to receive input of training data for training a decision tree. The system may further include a plurality of data batch processing units, each data batch processing unit being configured to evaluate each of a plurality of split functions of a decision tree for respective data batch of the training data, to thereby compute a partial histogram for each split function, for each datum in the data batch. The system may further include a plurality of node batch processing units configured to aggregate the associated partial histograms for each split function to form an aggregated histogram for each split function for each of a subset of frontier tree nodes and to determine a selected split function for each frontier tree node by computing the split function that produces highest information gain for the frontier tree node.
|
申请公布号 |
US8543517(B2) |
申请公布日期 |
2013.09.24 |
申请号 |
US20100797430 |
申请日期 |
2010.06.09 |
申请人 |
SHOTTON JAMIE;BUDIU MIHAI-DAN;FITZGIBBON ANDREW WILLIAM;FINOCCHIO MARK;MOORE RICHARD E.;ROBERTSON DUNCAN;MICROSOFT CORPORATION |
发明人 |
SHOTTON JAMIE;BUDIU MIHAI-DAN;FITZGIBBON ANDREW WILLIAM;FINOCCHIO MARK;MOORE RICHARD E.;ROBERTSON DUNCAN |
分类号 |
G06F15/18 |
主分类号 |
G06F15/18 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|