发明名称 SYSTEM AND METHOD FOR AUTOMATIC WEIGHT GENERATION FOR PROBABILISTIC MATCHING
摘要 Embodiments of the invention provide a system and method of automatically generating weights for matching data records. Each field of a record may be compared by an exact match and/or close matches and each comparison can result in a mathematical score which is the sum of the field comparisons. To sum up the field scores accurately, the automatic weight generation process comprises an iterative process. In one embodiment, initial weights are computed based upon unmatched-set probabilities and default discrepancy weights associated with attributes in the comparison algorithm. A bulk cross-match is performed across the records using the initial weights and a candidate matched set is computed for updating the discrepancy probabilities. New weights are computed based upon the unmatched probabilities and the updated discrepancy probabilities. Test for convergence between the new weights and the old weights. Repeat with the new weight table until the weights converge to their final value.
申请公布号 US2010175024(A1) 申请公布日期 2010.07.08
申请号 US20100727975 申请日期 2010.03.19
申请人 SCHUMACHER SCOTT;ELLARD SCOTT;ADAMS NORMAN S 发明人 SCHUMACHER SCOTT;ELLARD SCOTT;ADAMS NORMAN S.
分类号 G06F17/30;G06F3/048 主分类号 G06F17/30
代理机构 代理人
主权项
地址