发明名称 |
Using a bloom filter in a web analytics application |
摘要 |
Embodiments of the present invention are directed to the use of a Bloom Filter in a web analytics application. Large sets of data, obtained from on and offline sources, may be analyzed using a Bloom Filter to find desired patterns within the data. The use of a Bloom Filter in the analysis allows for a more efficient use of resources to perform the analysis. |
申请公布号 |
US9256686(B2) |
申请公布日期 |
2016.02.09 |
申请号 |
US200912558956 |
申请日期 |
2009.09.14 |
申请人 |
International Business Machines Corporation |
发明人 |
Woodruff Travis |
分类号 |
G06F17/30;H04L29/08 |
主分类号 |
G06F17/30 |
代理机构 |
Winstead, P.C. |
代理人 |
Winstead, P.C. |
主权项 |
1. A non-transitory computer readable medium embodying a set of instructions that when executed by a processor cause the processor to perform the steps comprising:
identifying a first set of data elements that satisfy a first criteria; entering identification information for each element of the first set of data elements into a bloom filter; applying the bloom filter to a set of user activities; and storing each element from the set of user activities that is included in the bloom filter in a results set; wherein the first set of data elements comprises a user segment, a user segment including a set of user identifiers that identify specific users; wherein the step of entering identification information for each element of the first set of data elements into a bloom filter further comprises:
processing each user identifier of the set of user identifiers with at least one hash function, said hash function mapping the user identifier to a bit osition within the bloom filter; andsetting the bit position identified by the at least one hash function to a value indicating true; wherein the at least one hash function further comprises: generating a 32 bit hash value of the user identifier; calculating the modulus of the 32 bit hash value with a size of the bloom filter, a result of the calculation indicating the bit position within the bloom filter that should be set to true; and setting the bit position of the bloom filter corresponding to the result to true. |
地址 |
Armonk NY US |