发明名称 |
Categorizing objects, such as documents and/or clusters, with respect to a taxonomy and data structures derived from such categorization |
摘要 |
A Website may be automatically categorized by (a) accepting Website information, (b) determining a set of scored clusters (e.g., semantic, term co-occurrence, etc.) for the Website using the Website information, and (c) determining at least one category (e.g., a vertical category) of a predefined taxonomy using at least some of the set of clusters. |
申请公布号 |
US8918395(B2) |
申请公布日期 |
2014.12.23 |
申请号 |
US201213528197 |
申请日期 |
2012.06.20 |
申请人 |
Google Inc. |
发明人 |
Gehrking David;Law Ching;Maxwell Andrew |
分类号 |
G06F7/00;G06F17/30 |
主分类号 |
G06F7/00 |
代理机构 |
Foley and Lardner LLP |
代理人 |
Foley and Lardner LLP ;Lanza John D. |
主权项 |
1. A computer-implemented method to associate a semantic cluster with one or more categories of a predefined taxonomy, the method comprising:
a) accepting, by a computer system including at least one computer, a plurality of semantic clusters of re-occurring terms within a document, and having a frequency based on the reoccurrence of the term; b) identifying, by the computer system based on the accepted clusters of re-occurring terms within the document, one or more concepts for the document, each concept identifying different re-occurring terms having identical meanings; c) scoring, by the computer system, the identified one or more concepts, the score of each of the one or more concepts weighted by cluster frequency of each of the re-occurring terms identified by said concept; d) identifying, by the computer system, a set of one or more categories using at least some of the one or more scored concepts to look up one or more categories in a concept-category index, wherein a category corresponds to a node of the predefined taxonomy, which defines a structured set of categories; and e) associating, by the computer system, at least some of the one or more categories with the semantic cluster. |
地址 |
Mountain View CA US |