发明名称 Configurable dynamic matching system
摘要 A system is provided that that dynamically matches data originating from one or more data sources. The system analyzes a matching configuration file, where the matching configuration file includes one or more matching configurations. The system modifies a probabilistic matching algorithm of a matching engine at runtime based on the one or more matching configurations and based on two or more data records of the plurality of data records that require matching. The system compares two data records of a plurality of data records using the modified probabilistic matching algorithm. The system generates a match score for the two data records based on the match weight for each data record field.
申请公布号 US8849837(B2) 申请公布日期 2014.09.30
申请号 US201213645591 申请日期 2012.10.05
申请人 Oracle International Corporation 发明人 Dua Swaranjit Singh
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Miles & Stockbridge P.C. 代理人 Miles & Stockbridge P.C.
主权项 1. A non-transitory computer-readable medium having instructions stored thereon that, when executed by a processor, cause the processor to dynamically match data originating from one or more data sources, the matching comprising: receiving data from one or more data sources, wherein the data comprises a plurality of data records; analyzing a matching configuration file, wherein the matching configuration file comprises one or more matching configurations, wherein the one or more matching configurations comprise one or more match sets, and wherein each match set comprises one or more data record fields and one or more frequency-based fields; modifying a probabilistic matching algorithm of a matching engine at runtime based on the one or more matching configurations and based on two or more data records of the plurality of data records that require matching; comparing two data records of the plurality of data records using the modified probabilistic matching algorithm, wherein the comparing comprises comparing values of one or more data record fields of the two data records and generating a match weight for each data record field based on the comparison of values, and the one or more matching configurations; generating a match score for the two data records based on the match weight for each data record field; and modifying the match score for the two data records based on a frequency of a value of the one or more frequency-based fields.
地址 Redwood Shores CA US