发明名称 ENTITY RESOLUTION BETWEEN DATASETS
摘要 Embodiments relate to entity resolution. One aspect includes creating a deterministic model by defining an entity to be resolved, selecting two datasets for comparison, defining matching predicates for attributes of the datasets to select a set of candidate matches, and defining a precedence rule for the candidate matches to select a subset of the candidate matches. An aspect further includes running the deterministic model on the two datasets. Running the deterministic model includes applying the matching predicates and the precedence rule to data in the datasets that correspond to the attributes. An aspect also includes applying a cardinality rule to results of the running, and outputting the matching candidates for which the cardinality rule is satisfied.
申请公布号 US2016125067(A1) 申请公布日期 2016.05.05
申请号 US201414529585 申请日期 2014.10.31
申请人 International Business Machines Corporation 发明人 Alexe Bogdan;Burdick Douglas R.;Hernandez-Sherrington Mauricio A.;Karanam Hima P.;Krishnamurthy Rajasekar;Popa Lucian;Vaithyanathan Shivakumar
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for entity resolution, the method comprising: creating, via a computer processor, a deterministic model, the creating comprising: defining an entity to be resolved;selecting two datasets for comparison;defining matching predicates for attributes of the datasets to select a set of candidate matches; anddefining at least one precedence rule for the candidate matches to select a subset of the candidate matches; running, via the computer processor, the deterministic model on the two datasets, the running comprising applying the matching predicates and the precedence rule to data in the datasets that correspond to the attributes; applying a cardinality rule to results of the running; and outputting, via the computer processor, the matching candidates for which the cardinality rule is satisfied.
地址 Armonk NY US