发明名称 Posting questions from search queries
摘要 The present disclosure is directed to a system and method for posting questions from search queries. In some implementations, a method includes identifying a plurality of different questions previously searched. The previously-searched questions each include a word indicating a question. The previously-searched questions are filtered to remove one or more specified words included with the question word. At least a subset of the plurality of previously-searched questions that can be used to generate a canonical form after removing the one or more specified words are determined. The subset of previously-searched questions are ranked based, at least in part, on a frequency of submission of each previously-submitted search query in the subset. A particular one of the previously-submitted search queries in the subset is identified as representative of the subset of previously-submitted search queries based on the ranking.
申请公布号 US8768920(B1) 申请公布日期 2014.07.01
申请号 US201213366984 申请日期 2012.02.06
申请人 Google Inc. 发明人 Coladonato Greg;Ke Huacheng
分类号 G06F7/00;G06F17/30;G06F15/16 主分类号 G06F7/00
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A method, comprising: identifying search queries that each include a question word of a plurality of predetermined question words; mapping each of the search queries to a corresponding canonical form, including applying mappings defined in an evaluation file to the search queries, the defined mappings including filtering that removes from the search queries any predetermined non-question words occurring in the search queries, the predetermined non-question words being obtained from the evaluation file,conjugating any verbs in the search queries to a particular verb tense,updating declensions of nouns in the search queries to a particular noun declension, andordering the words remaining in each of the search queries after the filtering, conjugating, and updating in a predefined way, including placing the question word in a predetermined position in the ordering; identifying a plurality of different search queries that each map to a particular canonical form; ranking the different search queries based on a frequency of occurrence of each of the different search queries; and selecting a highest-ranked different query as a representative query for each of the different search queries.
地址 Mountain View CA US
您可能感兴趣的专利