发明名称 |
SYSTEMS AND METHODS FOR WEIGHTED SAMPLING |
摘要 |
Techniques provided herein allow for a plurality of weighted samplings without replacement to be performed in O(n) time. According to some embodiments, the weighted sampling without replacement may be performed on a data set X of data elements {x1, x2, x3, . . . , xn}, where each data element xi of the data set X has an associated weight wi. The data set X may comprise data generated by a social networking system. Each data element xi in the data set X may be assigned with a random score si based on the weight wi associated with the data element xi. The random score si can be used to perform weighted sampling without replacement. |
申请公布号 |
US2015032761(A1) |
申请公布日期 |
2015.01.29 |
申请号 |
US201313951441 |
申请日期 |
2013.07.25 |
申请人 |
Facebook, Inc. |
发明人 |
Pasternack Jeff |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computer system comprising:
at least one processor; and a memory storing instructions configured to instruct the at least one processor to perform: identifying a data set X of n data elements from which to generate a sample set of m data elements, wherein n≧m, each data element xi of the data set X has a weight wi associated with the data element xi, and the data set X comprises data generated by a social networking system; assigning each data element xi in the data set X with a random score si based on the weight wi associated with the data element xi; and identifying the m data elements in the data set X, wherein the m data elements have assigned scores that are higher than scores assigned to remaining data elements of the data set X. |
地址 |
Menlo Park CA US |