发明名称 AUTOMATICALLY IDENTIFYING MATCHING RECORDS FROM MULTIPLE DATA SOURCES
摘要 A system identifies matching records from two or more different data sources. The system applies a scoring algorithm to identify potential matching pairs of records. A score is provided for each candidate pair of records. Records are pre-filtered based on predefined attributes. The scoring algorithm is applied to the filtered records. A set of potential matches are provided with a corresponding score. The set of potential matches are presented in a descending score order. A decision may be made for a best match based on the scores.
申请公布号 US2015095349(A1) 申请公布日期 2015.04.02
申请号 US201414160554 申请日期 2014.01.22
申请人 Microsoft Corporation 发明人 Sturzoiu Bogdan A.;Ilker M. Cavit
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method executed at least in part in a computing device to automatically select matching records from data sources, the method comprising: identifying at least two datasets from the data sources; filtering the at least two datasets to determine the matching records; identifying candidate matching pairs from the matching records; computing a score for each of the candidate matching pairs; and identifying a most likely match based on the score.
地址 Redmond WA US