发明名称 PROBABILISTIC INFORMATION RETRIEVAL NETWORKS
摘要 The frequency of occurrence of a representation in a collection of documents is estimated for document retrieval purposes by identifying the actual frequency of occurrence (actual fi) of the representation in a sample (ni) of documents and calculating the difference between the maximum (fmax) and minimum (fmin) probable frequencies of occurrence of the representation in the collection. If the difference does not exceed a limit, a midpoint of the maximum and minimum probable frequencies (fmean) is the estimated frequency of occurrence of the representation. Document distribution probabilities are optimized and probability thresholds are established for the identification of documents. An initial probability threshold is established and is adjusted as the probabilities are scored for documents in samples. The document result list (170) is iteratively adjusted through the samples.
申请公布号 WO9423386(A2) 申请公布日期 1994.10.13
申请号 WO1994US02579 申请日期 1994.03.10
申请人 WEST PUBLISHING COMPANY 发明人 TURTLE, HOWARD, R.;MORTON, GERALD, J.;LARNTZ, F., KINLEY
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址