发明名称 SCALABLE USER CLUSTERING BASED ON SET SIMILARITY
摘要 Methods and apparatus, including systems and computer program products, to provide clustering of users in which users are each represented as a set of elements representing items, e.g., items selected by users using a system. In one aspect, a program operates to obtain a respective interest set for each of multiple users, each interest set representing items in which the respective user expressed interest; for each of the users, to determine k hash values of the respective interest set, wherein the i-th hash value is a minimum value under a corresponding i-th hash function; and to assign each of the multiple users to each of the respective k clusters established for the respective user, the i-th cluster being represented by the i-th hash value. The assignment of each of the users to k clusters is done without regard to the assignment of any of the other users to k clusters.
申请公布号 EP1915669(A4) 申请公布日期 2011.01.05
申请号 EP20060801549 申请日期 2006.08.15
申请人 GOOGLE INC. 发明人 DATAR, MAYUR;GARG, ASHUTOSH
分类号 G06F7/00;G06Q30/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址