发明名称 System and process for record duplication analysis
摘要 A system and process for record duplication analysis that relies on a multi-membership Bayesian analysis to determine the probability that records within a data set are matches. The Bayesian calculation may rely on objective data describing the data set as well as subjective assessments of the data set. In addition, a system and process for record duplication analysis may rely on the predetermination of probabilistic patterns, where the system only searches for patterns exceeding a chosen threshold. Work flow may include selecting which fields within each record should be analyzed, normalizing the values within those fields and removing default data, calculating possible patterns and their match probabilities, analyzing record pairs to determine which have patterns exceeding a chosen threshold to determine the presence of duplicates, and merging duplicates, closing transactions reflecting non-duplicates, identifying records having insufficient data to determine the existence or lack of a match, and/or rolling back accidental merges.
申请公布号 US8554742(B2) 申请公布日期 2013.10.08
申请号 US20090498186 申请日期 2009.07.06
申请人 NAEYMI-RAD FRANK;CHARLOT REGIS;HAINES DAVID;CARDWELL MATTHEW C;DECARO MICHAEL;INTELLIGENT MEDICAL OBJECTS, INC. 发明人 NAEYMI-RAD FRANK;CHARLOT REGIS;HAINES DAVID;CARDWELL MATTHEW C;DECARO MICHAEL
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址