发明名称 A METHOD AND SYSTEM FOR INTEGRATING DATA INTO A DATABASE
摘要 A method and system for integrating data into a database (6) comprises storing data from a plurality of data sources (S1,Si). The system comprises a rule learning module (1 ) and a duplicate elimination module (2). The rule learning module (1 ) operates in an initial rule learning stage. The duplicate elimination module (2) then operates in a de-duplication stage using the learnt rules. The de-duplication rules use conditional probability to determine the probability of records in the data sources (S1,Si) being duplicates of one another. Duplicate records are integrated and stored in the integrated database (6).
申请公布号 WO2014012576(A1) 申请公布日期 2014.01.23
申请号 WO2012EP63930 申请日期 2012.07.16
申请人 QATAR FOUNDATION;BESKALES, GEORGE;KALDAS, IHAB FRANCIS ILYAS;HOARTON, LLOYD 发明人 BESKALES, GEORGE;KALDAS, IHAB FRANCIS ILYAS
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址