发明名称 EFFICIENT AND FAULT-TOLERANT DISTRIBUTED ALGORITHM FOR LEARNING LATENT FACTOR MODELS THROUGH MATRIX FACTORIZATION
摘要 A method for estimating model parameters. The method comprises receiving a data set related to a plurality of users and associated content, partitioning the data set into a plurality of sub data sets in accordance with the users so that data associated with each user are not partitioned into more than one sub data set, storing each of the sub data sets in a separate one of a plurality of user data storages, each of said data storages being coupled with a separate one of a plurality of estimators, storing content associated with the plurality of users in a content storage, where the content storage is coupled to the plurality of estimators so that the content in the content storage is shared by the estimators, and estimating, asynchronously by each estimator, one or more parameters associated with a model based on data from one of the sub data sets.
申请公布号 US2014310281(A1) 申请公布日期 2014.10.16
申请号 US201314123259 申请日期 2013.03.15
申请人 Yahoo! 发明人 Somekh Oren Shlomo;Bornikov Edward;Golbandi Nadav;Rokhlenko Oleg;Lempel Ronny
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method implemented on a machine having at least one processor, a storage, and a communication platform for estimating model parameters, comprising: receiving a data set related to a plurality of users and associated content; partitioning the data set into a plurality of sub data sets with respect to the users so that data associated with each user are not partitioned into more than one sub data set; storing each of the sub data sets in a separate one of a plurality of user data storages, each of the data storages being coupled with a separate one of a plurality of estimators; storing content associated with the plurality of users in a content storage, wherein the content storage is coupled to the plurality of estimators so that the content in the content storage is shared by the plurality of estimators; and estimating, asynchronously by each of the plurality of estimators, one or more parameters of a model associated with one of the sub data sets based on data from the corresponding sub data set.
地址 Sunnyvale CA US