发明名称 Method and system for finding similar records in mixed free-text and structured data
摘要 A technique for data mining where the available data contains both structured as well as unstructured (free-text) data. The present invention combines together the information available from different types of data to provide a single similarity score indicating the degree of similarity between records. Thus, a data evaluation application selects two records from a database and compares corresponding fields from the two records. The application determines whether to apply a nominal matching process, an ordinal matching process, or a vector-space matching process depending on the type of data in each pair of corresponding fields. The application sums the matching scores for all the fields in the records to compute the similarity score.
申请公布号 US7440946(B2) 申请公布日期 2008.10.21
申请号 US20060331934 申请日期 2006.01.13
申请人 THE MITRE CORPORATION 发明人 BLOEDORN ERIC
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址