发明名称 Text segmentation
摘要 Methods and systems for improving text segmentation are disclosed. In one embodiment, at least a first segmented result and a second segmented result are determined from a string of characters, a first frequency of occurrence for the first segmented result and a second frequency of occurrence for the second segmented result are determined, and an operable segmented result is identified from the first segmented result and the second segmented result based at least in part on the first frequency of occurrence and the second frequency of occurrence.
申请公布号 US8849852(B2) 申请公布日期 2014.09.30
申请号 US201113323664 申请日期 2011.12.12
申请人 Google Inc. 发明人 Elbaz Gilad Israel;Mandelson Jacob L.
分类号 G06F17/30;G06F17/27 主分类号 G06F17/30
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A computer-implemented method performed by data processing apparatus comprising: receiving a string of characters; identifying segmented results from the string of characters, wherein an identified segmented result includes one or more words that are formed from segmenting the string of characters into two or more sub-strings; determining levels at which the identified segmented results occur in one or more corpora; selecting one or more segmented results from the identified segmented results based on at least the determined levels; and providing the selected one or more segmented results in association with the string of characters.
地址 Mountain View CA US