发明名称 DATA QUALITY ANALYSIS
摘要 A method includes receiving information indicative of an output dataset generated by a data processing system; identifying, based on data lineage information relating to the output dataset, one or more upstream datasets on which the output dataset depends; analyzing one or more of the identified one or more upstream datasets on which the output dataset depends. The analyzing includes, for each particular upstream dataset of the one or more upstream datasets, applying one or more of: (i) a first rule indicative of an allowable deviation between a profile of the particular upstream dataset and a reference profile for the particular upstream dataset, and (ii) a second rule indicative of one or more allowable values or prohibited values for each of one or more data elements in the particular upstream dataset, and based on the results of applying the one or more rules, selecting one or more of the upstream datasets. The method includes outputting information associated with the selected one or more upstream datasets.
申请公布号 WO2016201176(A1) 申请公布日期 2016.12.15
申请号 WO2016US36813 申请日期 2016.06.10
申请人 AB INITIO TECHNOLOGY LLC 发明人 SPITZ, Chuck;GOULD, Joel
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址