发明名称 Scalable Parallel User Clustering in Discrete Time Window
摘要 Described is an internet user clustering technology, such as useful in behavioral targeting, in which users are clustered together based on MinHash computations that produce signatures corresponding to users' internet-related activities. In one aspect, users are clustered together based on commonality of signatures between each set of signatures associated with each user. The signature sets and/or clusters may be associated with timestamps, whereby clusters may be determined for a given discrete time window or set of discrete time windows. To facilitate efficient processing, existing, prior signature sets of a user may be incrementally updated (e.g., daily), and/or the MinHash computations for users are partitioned among parallel computing machines. The timestamps may be used to selectively determine a cluster within a continuous time, a time window or set of time windows.
申请公布号 US2010169258(A1) 申请公布日期 2010.07.01
申请号 US20080346881 申请日期 2008.12.31
申请人 MICROSOFT CORPORATION 发明人 YAN JUN;LIU NING;JI LEI;CHEN ZHENG
分类号 G06N5/02;G06F7/06;G06F17/30 主分类号 G06N5/02
代理机构 代理人
主权项
地址