发明名称 USING HIERARCHICAL RESERVOIR SAMPLING TO COMPUTE PERCENTILES AT SCALE
摘要 In one embodiment, in a hierarchy of nodes, a master node having two or more child nodes obtains from the two or more child nodes two or more sets of data samples or summaries associated therewith, the two or more sets of data samples being representative of traffic processed via two or more sets of servers corresponding to the two or more child nodes, wherein a size of each of the two or more sets of data samples is proportional to an allocation of traffic among the two or more sets of servers corresponding to the two or more child nodes. Each of the two or more sets of data samples is obtained from a different one of the two or more child nodes and represents traffic processed by a corresponding one of the two or more sets of servers. The master node combines the two or more sets of data samples or summaries associated therewith such that a combined set of data is generated. The master node ascertains a numerical value from the combined set of data.
申请公布号 US2016277490(A1) 申请公布日期 2016.09.22
申请号 US201514664043 申请日期 2015.03.20
申请人 Yahoo! Inc. 发明人 Wexler Mike;Ames Robert;Flint Ian
分类号 H04L29/08;H04L12/911;H04L12/26 主分类号 H04L29/08
代理机构 代理人
主权项 1. A method, comprising: at a master node in a hierarchy of nodes, the master node having two or more child nodes, obtaining from the two or more child nodes two or more sets of data samples or summaries associated therewith, the two or more sets of data samples being representative of traffic processed via two or more sets of servers corresponding to the two or more child nodes, wherein a size of each of the two or more sets of data samples is proportional to an allocation of traffic among the two or more sets of servers corresponding to the two or more child nodes, wherein each of the two or more sets of data samples is obtained from a different one of the two or more child nodes and represents traffic processed by a corresponding one of the two or more sets of servers; at the master node, combining the two or more sets of data samples or summaries associated therewith such that a combined set of data is generated; and at the master node, ascertaining a numerical value from the combined set of data.
地址 Sunnyvale CA US