发明名称 Segmenting text for searching
摘要 Methods, systems, and apparatus, including computer program products, for segmenting text for searching are disclosed. In one implementation, a method is provided. The method includes receiving text; segmenting the text into one or more unigrams; filtering the one or more unigrams to identify one or more core unigrams; and generating a searchable resource, including: for each of the one or more core unigrams: identifying a stem, indexing the stem, and associating one or more second n-grams with the indexed stem. Each of the one or more second n-grams is derived from the text and includes a core unigram that is related to the indexed stem.
申请公布号 US8423350(B1) 申请公布日期 2013.04.16
申请号 US20090470394 申请日期 2009.05.21
申请人 CHANDRA SUNIL;CHOPRA HARSHIT;SHANMUGAM SIDDAARTH;GOOGLE INC. 发明人 CHANDRA SUNIL;CHOPRA HARSHIT;SHANMUGAM SIDDAARTH
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址