发明名称 Document Comparison Using Multiple Similarity Measures
摘要 Disclosed herein is a method for comparing documents. The method includes the steps of: determining a plurality of similarity measures; and determining an overall similarity measure for the plurality of documents, based on the plurality of similarity measures. In one embodiment, the similarity measures are chosen from the group of similarity measures consisting of semantic and reference similarity measures. When comparing documents from the chemical, biochemical or pharmaceutical domains, the determination of the similarity utilizes a determination of structural similarity of the chemical formulas described in the plurality of documents.
申请公布号 US2009037389(A1) 申请公布日期 2009.02.05
申请号 US20080193803 申请日期 2008.08.19
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 KOTHARI RAVI;MUKHERJEA SOUGATA
分类号 G06F7/06;G06F17/30 主分类号 G06F7/06
代理机构 代理人
主权项
地址