发明名称 Method of generating a distributed text index for parallel query processing
摘要 The present invention relates to a method of generating a distributed text index for parallel query processing by a number of nodes. A set of node indices is generated for text indexing a set of documents, each node text index covering a subset of the documents. For each node text index, a local frequency measure for each term of the node text index is calculated on the basis of a frequency of documents containing the term in the subset of the documents of the node. A global frequency measure for each term is calculated on the basis of a frequency of documents containing the term in the set of documents. A quality measure for each node text index is calculated on the basis of the local frequency measures of the terms of the node and the global frequency measure of the terms of the node.
申请公布号 US7324988(B2) 申请公布日期 2008.01.29
申请号 US20040805402 申请日期 2004.03.19
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 ALTEVOGT PETER;NITZSCHE RAIKO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址