发明名称 SIMILAR DOCUMENT RETRIEVAL DEVICE, AND SIMILAR DOCUMENT RETRIEVAL METHOD AND PROGRAM
摘要 PROBLEM TO BE SOLVED: To provide a similar document retrieval device and a similar document retrieval method and program capable of discovering a document where not only input retrieval character strings coincide but similar tags are attached or a document where a similar keyword is used in retrieving a similar document from a large amount of documents stored in a text format. SOLUTION: In a similar document retrieval device for retrieving documents, with respect to a normalized document to be a document which normalizes words included in an input document, stores words for which a tag is attached to the document, and normalizes words in the document, a tag is generated in the document according to a setting file, an index where a word to be a clue for retrieving a similar document matches a document ID is generated, statistical information to be statistical information on names of attributes included in the document stored in the storage means is generated, and the document is retrieved on the basis of the statistical information. COPYRIGHT: (C)2009,JPO&INPIT
申请公布号 JP2009104475(A) 申请公布日期 2009.05.14
申请号 JP20070276794 申请日期 2007.10.24
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 IZAWA MINAKO;KONO TAKASHI;NAKAWATASE SHUICHI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址