发明名称 Determining word boundary likelihoods in potentially incomplete text
摘要 Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining word boundary likelihoods in potentially incomplete text. In one aspect, a method includes selecting query sequences from the query, each query sequence being at least a portion of a word n-gram, the word n-gram being a subsequence of up to n words selected from the second sequence of words of the query, and for each query sequence: determining one or more query sequence keys for the query sequence; determining at least one of a word boundary count and a non-word boundary count for each query sequence key, each word-boundary count and non-word boundary count being dependent on the context of the query sequence; and associating, in a data storage device, the at least one word boundary count and non-word boundary counts with each query sequence key.
申请公布号 US9239888(B1) 申请公布日期 2016.01.19
申请号 US201414560091 申请日期 2014.12.04
申请人 Google Inc. 发明人 Das Abhinandan S.;Fung Harry S.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A method performed by a data processing apparatus, the method comprising: receiving, for a query sequence, a word boundary likelihood that represents a likelihood that the query sequence terminates at a word boundary; determining, based on the word boundary likelihood, a time delay for delaying providing search results for the query sequence; determining that an amount of time since receipt of the query sequence exceeds the time delay, and in response: identifying search results responsive to the query sequence; andproviding the identified search results.
地址 Mountain View CA US