发明名称 Term-statistics modification for category-based search
摘要 A method for searching a document collection includes providing an index of terms indicating the documents in which the terms appear. A first statistical distribution of each of at least some of the terms in the index and a second statistical distribution of each of at least some of the categories are estimated a over the documents in the collection. A query including one or more of the terms and a category restriction referring to at least one of the categories is accepted. A modified term distribution is produced by operating on the first statistical distribution of at least one of the terms in the query using the second statistical distribution, responsively to the category restriction. The query is applied to the index to return a response, in which occurrences of the at least one of the terms are scored responsively to the modified term distribution.
申请公布号 US7401073(B2) 申请公布日期 2008.07.15
申请号 US20050117749 申请日期 2005.04.28
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CARMEL DAVID;DARLOW ADAM;PETRUSCHKA YAEL;SOFFER AYA
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址