发明名称 CALCULATION METHOD AND DEVICE FOR SIMILARITY OF CHARACTER STRING AND RECORDING MEDIUM
摘要 PROBLEM TO BE SOLVED: To calculate the similarity of character strings with emphasis put on words and to retrieve a document without analyzing a morpheme. SOLUTION: In this calculation method of similarity of character strings, an input character string and a document of a document data base are defined as two character strings and the similarity of both character strings is calculated by a similarity calculation part 14. A coincident character string similarity calculation part 21 of the part 14 calculates the character string score to a partial character string that is common to both character strings and adds this score to the similarity of the remaining partial strings. An optional character string similarity calculation part 22 shifts the correspondence relation of both character strings to calculates a larger degree of similarity, and a maximum value selection part 23 selects the larger degree of similarity. These processes are repeated to totalize the score of partial character strings adaptive to the sequences of two character strings, i.e., the partial character strings which are common to each other and to calculate the final similarity. A retrieval result output part 13 selects a document of a high degree of similarity out of a document data base as a retrieval result.
申请公布号 JP2001067378(A) 申请公布日期 2001.03.16
申请号 JP20000188490 申请日期 2000.06.22
申请人 SUMITOMO ELECTRIC IND LTD 发明人 UMEMURA KYOJI
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址