发明名称 Multifaceted search
摘要 A query is received that includes two or more facets of a multidimensional inverted index for a collection of documents. Each document is associated with at least one facet. Generation of the multidimensional inverted index includes creating one or more entries. Each entry includes a combination of two or more facets and a posting list of indications for the documents associated with respective facets of each entry. Each indication identifies a document. Generation of the index also includes determining documents associated with respective facets of the combination of each entry. The multidimensional inverted index is searched for an entry having the combination of two or more facets included in the query and a search result is returned. An indication for a document may be included in a posting list if it is determined that the document is associated with each facet of the combination of facets of the entry.
申请公布号 US9535979(B2) 申请公布日期 2017.01.03
申请号 US201313923406 申请日期 2013.06.21
申请人 International Business Machines Corporation 发明人 Ambasta Rohan A.;Ganesh Bharath;Gokhale Parag S.;Jain Chandrashekhar
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人 Shatto Robert J.
主权项 1. A computer-implemented method of searching, comprising: receiving a query including two or more facets of a single multidimensional inverted index for a collection of documents, each document being associated with at least one facet, wherein the query further includes one or more non-index terms; wherein the single multidimensional inverted index is generated by: creating one or more entries, each entry including a combination of two or more facets, wherein each of the two or more facets is not a sub-category of any other facet of the two or more facets,determining documents associated with respective facets of the combination of two or more facets of each entry, andcreating a posting list of indications for the documents associated with respective facets of each entry, each indication identifying a document; determining a format of the single multidimensional inverted index, wherein the format includes an order of the two or more facets in the single multidimensional inverted index; normalizing the query to conform to the format of the single multidimensional inverted index, wherein the normalizing includes reordering the two or more facets in the query, the reordered two or more facets having the same order as the two or more facets in the single multidimensional inverted index; searching the single multidimensional inverted index for an entry having the combination of two or more facets included in the query; identifying, based on the searching the single multidimensional inverted index, a set of documents having the combination of two or more facets included in the query; searching the set of documents for the one or more non-index terms; generating a search result, wherein the search result includes only documents of the set of documents that include the one or more non-index terms; and returning the search result.
地址 Armonk NY US