发明名称 Method and apparatus for calculating similarity among documents
摘要 Information that individual elements (characteristic character strings) indicative of characteristics of a registered document appear in the registered document is stored in advance. When calculating similarity of the registered document, a query designated by a searcher is analyzed. The query is represented by a characteristic vector having the individual elements which take the relation between a plurality of words into consideration. Pieces of appearance information of the individual words contained in the query are counted. The counted appearance information is compared with a searching index to calculate similarity between documents.
申请公布号 US7440938(B2) 申请公布日期 2008.10.21
申请号 US20040838231 申请日期 2004.05.05
申请人 HITACHI, LTD. 发明人 MATSUBAYASHI TADATAKA;SUGAYA NATSUKO;IIJIMA MICHIO;OGAWA YUICHI;WATANABE YUUKI;YAMAMOTO SHINYA;SUDOU TSUYOSHI
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址