发明名称 Parallel decision-or regression-tree growing
摘要 <p>The invention relates to a method of creating decision trees or regression trees for machine learning applications. The process of training the trees effectively uses a parallel computation including multiple computer processors in growing ensemble tree models. More specifically the invention is characterised by using processing units which have associated storage units comprising a data slice and a database management system operable to execute a method for growing multiple trees. The method comprising: creating subsets (or data bags) from a training dataset for training each of the trees to be grown, splitting the training set into disjoint data sub-sets and storing them in the data slices, creating root nodes for the trees, assigning data records of the bags to the root nodes of the trees to be grown and growing the trees iteratively wherein each iteration generates a node level of the ensemble of trees by passing through all of the data records in all slices by processing each slice in parallel.</p>
申请公布号 GB2516627(A) 申请公布日期 2015.02.04
申请号 GB20130013326 申请日期 2013.07.26
申请人 WARSAW UNIVERSITY OF TECHNOLOGY;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 PAWEL CICHOSZ;MIECZYSLAW KLOPOTEK;KRYSZTOF SKOWRONSKI
分类号 G06N5/04;G06N3/08 主分类号 G06N5/04
代理机构 代理人
主权项
地址