发明名称 Methods and systems for organizing content
摘要 A computer-implemented method executes instructions stored on a computer-readable medium. The method includes accessing a hierarchy of clusters, wherein each cluster includes at least one content file, and a label is associated with each cluster. The method further includes calculating a topic purity score for each cluster, and selecting a first cluster and a second cluster from the hierarchy of clusters, wherein the topic purity score of the first cluster and the second cluster are less than a purity threshold. The method also includes creating a third cluster by combining the content files included within the first cluster and the second cluster, determining a parent category of the first cluster and the second cluster, wherein the parent category is at a level within the hierarchy higher than a level of the first cluster and the second cluster, and associating a label of the parent category with the third cluster.
申请公布号 US8972404(B1) 申请公布日期 2015.03.03
申请号 US201213531081 申请日期 2012.06.22
申请人 Google Inc. 发明人 Lewis Glenn M.;Buryak Kirill;Ben-Artzi Aner;Peng Jun;Benbarak Nadav
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Armstrong Teasdale LLP 代理人 Armstrong Teasdale LLP
主权项 1. A computer implemented method including executing instructions stored on a computer-readable medium, said method comprising: accessing a hierarchy of clusters, wherein each cluster includes at least one content file, and wherein a label is associated with each cluster; calculating a topic purity score for each cluster; selecting a first cluster and a second cluster from the hierarchy of clusters, wherein the topic purity score of the first cluster and the topic purity score of the second cluster are less than a purity threshold; creating a third cluster by combining the content files included within the first cluster and the second cluster; determining a parent category of the first cluster and the second cluster, wherein the parent category is at a level within the hierarchy higher than a level of the first cluster and the second cluster; determining whether an action is associated with the parent category, the action providing a response to a topic of the content file; associating a label of the parent category with the third cluster if an action is associated with the parent category such that the label of the first cluster and the label of the second cluster are replaced with the label of the parent category; and retaining the labels of the first cluster and the second cluster if no action is associated with the parent category.
地址 Mountain View CA US