发明名称 DATA QUALITY ANALYSIS AND CLEANSING OF SOURCE DATA WITH RESPECT TO A TARGET SYSTEM
摘要 A system transfers data between source systems and a target system. The system determines a domain score for data domains of source data from the source systems based on data quality metrics for the target system. The domain score indicates data quality with respect to the target system. Corresponding processes of the target system are identified for the data domains, and a process score is determined for the identified processes based on a corresponding domain score. The process score indicates data quality with respect to the identified processes. The system cleanses the source data based on the domain score and/or process score, and validates the cleansed source data against the target system for transference. Embodiments of the present invention further include a method and computer program product for transferring data between source systems and a target system in substantially the same manner described above.
申请公布号 US2016070725(A1) 申请公布日期 2016.03.10
申请号 US201514692269 申请日期 2015.04.21
申请人 International Business Machines Corporation 发明人 Marrelli Carl M.;Narayanan Ram S.;Oberhofer Martin;Rashidi Solmaz
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method of transferring data between one or more source systems and a target system comprising: determining a domain score for one or more data domains of source data from the one or more source systems based on one or more data quality metrics for the target system, wherein the domain score provides an indication of data quality of the source data with respect to the target system; identifying one or more corresponding processes of the target system for the one or more data domains and determining a process score for the one or more identified processes based on a corresponding domain score, wherein the process score indicates data quality of the source data with respect to the identified processes; cleansing the source data based on one or more from a group of the domain score and process score; and validating the cleansed source data against the target system for transference to the target system.
地址 Armonk NY US