发明名称 OPTIMIZED TRAINING OF LINEAR MACHINE LEARNING MODELS
摘要 An indication of a data source to be used to train a linear prediction model is obtained. The model is to generate predictions using respective parameters assigned to a plurality of features derived from observation records of the data source. The parameter values are stored in a parameter vector. During a particular learning iteration of the training phase of the model, one or more features for which parameters are to be added to the parameter vector are identified. In response to a triggering condition, parameters for one or more features are removed from the parameter vector based on an analysis of relative contributions of the features represented in the parameter vector to the model's predictions. After the parameters are removed, at least one parameter is added to the parameter vector.
申请公布号 US2016078361(A1) 申请公布日期 2016.03.17
申请号 US201414484201 申请日期 2014.09.11
申请人 Amazon Technologies, Inc. 发明人 BRUECKNER MICHAEL;BLICK DANIEL
分类号 G06N99/00;H04L29/08 主分类号 G06N99/00
代理机构 代理人
主权项 1. A system, comprising: one or more computing devices configured to: receive, at a machine learning service of a provider network, an indication of a data source to be used for generating a linear prediction model, wherein, to generate a prediction, the linear prediction model is to utilize respective weights assigned to individual ones of a plurality of features derived from observation records of the data source, wherein the respective weights are stored in a parameter vector of the linear prediction model;determine, based at least in part on examination of a particular set of observation records of the data source, respective weights for one or more features to be added to the parameter vector during a particular learning iteration of a plurality of learning iterations of a training phase of the linear prediction model;in response to a determination that a triggering condition has been met during the training phase, identify one or more pruning victims from a set of features whose weights are included in the parameter vector, based at least in part on a quantile analysis of the weights, wherein the quantile analysis is performed without a sort operation; andremove at least a particular weight corresponding to a particular pruning victim of the one or more pruning victims from the parameter vector; andgenerate, during a post-training-phase prediction run of the linear prediction model, a prediction using at least one feature for which a weight is determined after the particular weight of the particular pruning victim is removed from the parameter vector.
地址 Reno NV US
您可能感兴趣的专利