发明名称 Using a bloom filter in a web analytics application
摘要 Embodiments of the present invention are directed to the use of a Bloom Filter in a web analytics application. Large sets of data, obtained from on and offline sources, may be analyzed using a Bloom Filter to find desired patterns within the data. The use of a Bloom Filter in the analysis allows for a more efficient use of resources to perform the analysis.
申请公布号 US9256686(B2) 申请公布日期 2016.02.09
申请号 US200912558956 申请日期 2009.09.14
申请人 International Business Machines Corporation 发明人 Woodruff Travis
分类号 G06F17/30;H04L29/08 主分类号 G06F17/30
代理机构 Winstead, P.C. 代理人 Winstead, P.C.
主权项 1. A non-transitory computer readable medium embodying a set of instructions that when executed by a processor cause the processor to perform the steps comprising: identifying a first set of data elements that satisfy a first criteria; entering identification information for each element of the first set of data elements into a bloom filter; applying the bloom filter to a set of user activities; and storing each element from the set of user activities that is included in the bloom filter in a results set; wherein the first set of data elements comprises a user segment, a user segment including a set of user identifiers that identify specific users; wherein the step of entering identification information for each element of the first set of data elements into a bloom filter further comprises: processing each user identifier of the set of user identifiers with at least one hash function, said hash function mapping the user identifier to a bit osition within the bloom filter; andsetting the bit position identified by the at least one hash function to a value indicating true; wherein the at least one hash function further comprises: generating a 32 bit hash value of the user identifier; calculating the modulus of the 32 bit hash value with a size of the bloom filter, a result of the calculation indicating the bit position within the bloom filter that should be set to true; and setting the bit position of the bloom filter corresponding to the result to true.
地址 Armonk NY US