发明名称 DOCUMENT SIMILARITY COMPUTING DEVICE, CLUSTERING DEVICE AND DOCUMENT EXTRACTION DEVICE
摘要 PROBLEM TO BE SOLVED: To efficiently perform clustering and document extraction by computing document similarity used as an absolute value, with high accuracy without depending on a document size. SOLUTION: This document similarity computing device is provided with an input part 11 for inputting a document set, and a normalization part 14 for computing similarity used as the relative value between the documents in the inputted document set, respectively on a plurality of combinations of documents by a tf-idf method using a document vector and the importance of words included in the documents, and converting each similarity into an absolute value by normalization. COPYRIGHT: (C)2003,JPO
申请公布号 JP2003263443(A) 申请公布日期 2003.09.19
申请号 JP20020062239 申请日期 2002.03.07
申请人 FUJITSU LTD 发明人 NANBA ISAO
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址