发明名称 Adaptive warehouse data validation tool
摘要 Techniques for data validation may include dynamically generating one or more database queries to be performed on a target data warehouse and a baseline data warehouse based on warehouse model metadata for the target data warehouse and the baseline data warehouse. The techniques may further include executing the one or more database queries against the target data warehouse and the baseline data warehouse to receive one or more data sets from the baseline data warehouse and one or more data sets from the target data warehouse. The techniques may further include comparing the one or more data sets from the baseline data warehouse and the one or more data sets from the target data warehouse to validate target data in the target data warehouse against baseline data in the baseline data warehouse.
申请公布号 US9563679(B2) 申请公布日期 2017.02.07
申请号 US201414476421 申请日期 2014.09.03
申请人 International Business Machines Corporation 发明人 Seto Harold
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Shumaker & Sieffert, P.A. 代理人 Shumaker & Sieffert, P.A.
主权项 1. A method for validating data in a data warehouse, the method comprising: dynamically generating, by at least one processor, one or more database queries to be performed on a target data warehouse and a baseline data warehouse based on warehouse model metadata for the target data warehouse and the baseline data warehouse, including: generating one or more queries against the warehouse model metadata,executing the one or more queries against the warehouse model metadata to extract, from the warehouse model metadata, information regarding a warehouse object, the extracted information indicating one or more dimension tables referenced by a fact table in the warehouse object, anddynamically generating the one or more database queries based at least in part on the extracted information; executing, by the at least one processor, the one or more database queries against the target data warehouse and the baseline data warehouse to receive one or more data sets from the baseline data warehouse and one or more data sets from the target data warehouse; and comparing, by the at least one processor, the one or more data sets from the baseline data warehouse and the one or more data sets from the target data warehouse to validate target data in the target data warehouse against baseline data in the baseline data warehouse.
地址 Armonk NY US