发明名称 COMPUTER AIDED DOCUMENT RETRIEVAL
摘要 A method of determining cluster attractors for a plurality of documents comprising at least one term. The method comprises calculating, in respect of each term, a probability distribution indicative of the frequency of occurrence of the, or each, other term that co-occurs with said term in at least one of said documents. Then, the entropy of the respective probability distribution is calculated. Finally, at least one of said probability distributions is selected as a cluster attractor depending on the respective entropy value. The method facilitates very small clusters to be formed enabling more focused retrieval during a document search.
申请公布号 CA2540241(C) 申请公布日期 2013.09.17
申请号 CA20042540241 申请日期 2004.09.27
申请人 UNIVERSITY OF ULSTER;ST. PETERSBURG STATE UNIVERSITY 发明人 PATTERSON, DAVID;DOBRYNIN, VLADIMIR
分类号 G06F17/30;G06K9/62 主分类号 G06F17/30
代理机构 代理人
主权项
地址