发明名称 Method and system for evaluating content via a computer network
摘要 Systems and methods of evaluating information via a computer network are provided. A content group can be identified, and each item of the content group can be associated with a vector indicating at least one user interest category of users exposed to the item. The vectors of each item can be evaluated to generate a first nearest neighbor list of each item of the content group. The nearest neighbor list of a first item can be compared with the nearest neighbor list of a second item. Based on a result of the comparison, the first and second items can be associated in a cluster.
申请公布号 US8745074(B1) 申请公布日期 2014.06.03
申请号 US201213619467 申请日期 2012.09.14
申请人 Google Inc. 发明人 Covell Michele;Baluja Shumeet;Sacks Josh;Wu Yuchen
分类号 G06F17/30;G06F7/00 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer implemented method of evaluating information via a computer network, comprising: identifying, by a data processing system, a first content group comprising one or more content items, each content item of the first content group having an interest category vector indicating at least one user interest category of users exposed to the content item; evaluating, by the data processing system, the interest category vector of a content item of the first content group in conjunction with an interest category vector of a content item of each of a plurality of other content groups to calculate a plurality of first distance metrics, each first distance metric indicating a similarity between the first content group and one of the other content groups; generating, by the data processing system, a first nearest neighbor list of the first content group, the first nearest neighbor list comprising a ranking of the other content groups based on the calculated first distance metrics; comparing the first nearest neighbor list of the first content group with a second nearest neighbor list of a second content group to calculate a second distance metric indicating a similarity between the first nearest neighbor list and the second nearest neighbor list; and based on the calculated second distance metric, combining the first content group with the second content group in a cluster that replaces the first and second content groups.
地址 Mountain View CA US