发明名称 SELECTION OF ATOMS FOR SEARCH ENGINE RETRIEVAL
摘要 Methods are provided for populating search indexes with atoms identified in documents. Documents that are to be indexed are identified, and for each document, atoms are identified and are categorized as unigrams, n-grams, and n-tuples. A list of atom/document pairs is generated such that an information metric can be computed for each pair. An information metric represents a ranking of the atom in relation to the particular document. Based on the information metric, some atom/document pairs are discarded and others are indexed.
申请公布号 US2012130981(A1) 申请公布日期 2012.05.24
申请号 US201113045278 申请日期 2011.03.10
申请人 RISVIK KNUT MAGNE;HOPCROFT MIKE;BENNETT JOHN G.;KALYANARAMAN KARTHIK;CHILIMBI TRISHUL;MICROSOFT CORPORATION 发明人 RISVIK KNUT MAGNE;HOPCROFT MIKE;BENNETT JOHN G.;KALYANARAMAN KARTHIK;CHILIMBI TRISHUL
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址