<p>Systems and methods for clustering a group of data points based on a measure of similarity between each pair of data points in the group are provided. A pairwise similarity function can be estimated for each pair of data points in the group. A clustering algorithm can be executed to create clusters and associate data points with the clusters using the pairwise similarity function. The algorithm can be iterated multiple times until a stopping condition is reached in order to reduce variance in the output of the algorithm. The pairwise similarity function for each pair of data points can be updated between iterations of the algorithm and the results of each iteration can be aggregated. The data in each data point associated with a cluster can be consolidated into a consolidated data point.</p>
申请公布号
WO2010054349(A2)
申请公布日期
2010.05.14
申请号
WO2009US63794
申请日期
2009.11.10
申请人
GOOGLE INC.;AILON, NIR;LIBERTY, EDO;KHALSA, HARISHABD