发明名称 Adaptive distinct counting for network-traffic monitoring and other applications
摘要 In one embodiment, a counting method of the invention uses an adaptive sketching-update process to compress an unknown cardinality into a counter value that counts the number of binary ones in a hashed bitmap vector. The sketching-update process is probabilistic in nature and uses bit-flip probabilities that are adaptively decreased as the counter value increases. Parameters of the sketching-update process are selected so that the relative error of cardinality estimates obtained based on the counter values is relatively small and substantially constant over a relatively wide range of cardinalities, e.g., from one to about one million. Due to the latter property, the counting method can advantageously be implemented in the form of embedded software that relies on a relatively small, fixed amount of memory.
申请公布号 US8931088(B2) 申请公布日期 2015.01.06
申请号 US201012732293 申请日期 2010.03.26
申请人 Alcatel Lucent 发明人 Chen Aiyou;Cao Jin;Menten Lawrence E.
分类号 G06F11/00;H04L29/06;H04L12/26 主分类号 G06F11/00
代理机构 Mendelsohn, Drucker & Dunleavy, P.C. 代理人 Mendelsohn, Drucker & Dunleavy, P.C. ;Gruzdkov Yuri;Mendelsohn Steve
主权项 1. A device-implemented method of distinct counting, the method comprising: (A) deriving a hash key for an item of a plurality of items; (B) applying a hash function to the hash key to identify a corresponding target bucket in a bitmap, wherein the bitmap has a plurality of buckets, each for storing a value; and (C1) if the target bucket has a first value, then probabilistically changing the first value to a second value, with a probability of the change depending on a total number of second values stored in the bitmap, wherein step (B) comprises: (B1) generating a hash value by applying the hash function to the hash key;(B2) converting the generated hash value into a number that is different from said hash value, wherein said converting includes truncating the generated hash value to a selected number of most significant bits; and(B3) using the number produced at step (B2) to identify the target bucket.
地址 Boulogne-Billancourt FR