发明名称 SYSTEM, METHOD AND PROGRAM FOR OPERATING DOCUMENT MATCHING DEGREE
摘要 PROBLEM TO BE SOLVED: To correctly estimate matching degree of a document to a retrieval word independently of types of the document. SOLUTION: This system computes a TF term obtained by reflecting frequency of an inputted retrieval word in a target document and an IDF term obtained by reflecting importance of the retrieval word based on information of a multi-document information storing means, and computes document matching degree indicating the degree of matching between the target document and the inputted one or a plurality of retrieval words from the TF term and the IDF term for the retrieval word. An expectation value of appearance frequency of the retrieval word t in the target document d when the document d is included in a document setσ(t) appropriate to the retrieval word t is calculated by approximating the document setσ(t) by a document setκ(t) being the whole documents where the retrieval word t appears, and the difference between the expectation value and the appearance frequency that the retrieval word t actually appears in the target document d is reflected in the TF term. COPYRIGHT: (C)2006,JPO&NCIPI
申请公布号 JP2006011851(A) 申请公布日期 2006.01.12
申请号 JP20040188434 申请日期 2004.06.25
申请人 OKI ELECTRIC IND CO LTD 发明人 HAMAGUCHI YOSHITAKA
分类号 G06F17/30;G06F7/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址