发明名称 Efficient Retrieval Algorithm by Query Term Discrimination
摘要 An exemplary method for use in information retrieval includes, for each of a plurality of terms, selecting a predetermined number of top scoring documents for the term to form a corresponding document set for the term; receiving a plurality of terms, optionally as a query; ranking the plurality of terms for importance based at least in part on the document sets for the plurality of terms where the ranking comprises using an inverse document frequency algorithm; selecting a number of ranked terms based on importance where each selected, ranked term comprises its corresponding document set wherein each document in a respective document set comprises a document identification number; forming a union set based on the document sets associated with the selected number of ranked terms; and, for a document identification number in the union set, scanning a document set corresponding to an unselected term for a matching document identification number. Various other exemplary systems, methods, devices, etc. are also disclosed.
申请公布号 US2008215574(A1) 申请公布日期 2008.09.04
申请号 US20080038652 申请日期 2008.02.27
申请人 MICROSOFT CORPORATION 发明人 LIN CHENXI;JI LEI;ZENG HUAJUN;ZHANG BENYU;CHEN ZHENG;WANG JIAN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址