发明名称 SCALABLE AUTOMATIC DATA REPAIR
摘要 A computer implemented method for generating a set of updates for a database comprising multiple records including erroneous, missing and inconsistent values, the method comprising using a set of partitioning functions for subdividing the records of the database into multiple subsets of records, allocating respective ones of the records to at least one subset according to a predetermined criteria for mapping records to subsets, applying multiple machine learning models to each of the subsets to determine respective candidate replacement values representing a tuple repair for a record including a probability of candidate and current values for the record, computing probabilities to select replacement values for the record from among the candidate replacement values which maximise the probability for values of the record for an updated database.
申请公布号 WO2012160171(A2) 申请公布日期 2012.11.29
申请号 WO2012EP59772 申请日期 2012.05.24
申请人 QATAR FOUNDATION;YAKOUT, MOHAMED;ELMAGARMID, AHMED K.;BERTI-EQUILLE, LAURE;HOARTON, LLOYD 发明人 YAKOUT, MOHAMED;ELMAGARMID, AHMED K.;BERTI-EQUILLE, LAURE
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址