发明名称 System And Method For Identifying Relevant Search Results Via An Index
摘要 A system and method for identifying relevant search results via an index is provided. A search query is received. A semantic representation of query substructures and a list of key terms is generated for the query. Each key term includes a term in the search query or a term related to the query. An inverted index having key terms each associated with a semantic representation and a link to a source reference is accessed. The inverted index is queried using a subset of key terms. A result set for the subset key terms is identified within the inverted index. Each result is scored and a subset of the result set is identified as retrieval candidates based on the scoring. One or more of the retrieval candidates are selected based on a comparison of the query semantic representation with the semantic representations for the retrieval candidates.
申请公布号 US2016196340(A1) 申请公布日期 2016.07.07
申请号 US201615069924 申请日期 2016.03.14
申请人 Palo Alto Research Center Incorporated 发明人 Cheslow Robert D.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A system for identifying relevant search results via an index, comprising: a query module to receive a search query and to generate a semantic representation for the query comprising a plurality of substructures as a semantic analysis of the search query and a list of key terms, wherein the key terms each comprise a term in the search query or a term related to one of the terms in the search query; an inverted index comprising a set of key terms each associated with a semantic representation and a link to a source reference; a candidate module to identify retrieval candidates for comparing the associated semantic representations with the semantic representation of the search query, comprising: a term selection module to select a subset of the key terms from the search query and to query the inverted index with the key terms in the subset;a result module to identify within the inverted index a result set for each of the key terms in the subset; anda scoring module to score each of the results in the set and to identify a subset of the result sets as the retrieval candidates based on the scoring; and a candidate selection module to select one or more of the retrieval candidates based on a comparison of the semantic representation of the search query with the semantic representations for each of the retrieval candidates.
地址 Palo Alto CA US