发明名称 Categorizing objects, such as documents and/or clusters, with respect to a taxonomy and data structures derived from such categorization
摘要 A Website may be automatically categorized by (a) accepting Website information, (b) determining a set of scored clusters (e.g., semantic, term co-occurrence, etc.) for the Website using the Website information, and (c) determining at least one category (e.g., a vertical category) of a predefined taxonomy using at least some of the set of clusters.
申请公布号 US8918395(B2) 申请公布日期 2014.12.23
申请号 US201213528197 申请日期 2012.06.20
申请人 Google Inc. 发明人 Gehrking David;Law Ching;Maxwell Andrew
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 Foley and Lardner LLP 代理人 Foley and Lardner LLP ;Lanza John D.
主权项 1. A computer-implemented method to associate a semantic cluster with one or more categories of a predefined taxonomy, the method comprising: a) accepting, by a computer system including at least one computer, a plurality of semantic clusters of re-occurring terms within a document, and having a frequency based on the reoccurrence of the term; b) identifying, by the computer system based on the accepted clusters of re-occurring terms within the document, one or more concepts for the document, each concept identifying different re-occurring terms having identical meanings; c) scoring, by the computer system, the identified one or more concepts, the score of each of the one or more concepts weighted by cluster frequency of each of the re-occurring terms identified by said concept; d) identifying, by the computer system, a set of one or more categories using at least some of the one or more scored concepts to look up one or more categories in a concept-category index, wherein a category corresponds to a node of the predefined taxonomy, which defines a structured set of categories; and e) associating, by the computer system, at least some of the one or more categories with the semantic cluster.
地址 Mountain View CA US