发明名称 Automated data analysis and transformation
摘要 A transformation method and system is provided. The method includes generating a data hub application configured to embed extract, transform, and load (ETL) processes. The data hub application is linked to source tables and target tables. Meta data associated with the source and target tables is transferred from virtual views of the data hub application to an ETL work area of the ETL processes. An ETL job is generated and linked to the data hub application. ETL processes are executed and results are determined.
申请公布号 US8768880(B2) 申请公布日期 2014.07.01
申请号 US201314041009 申请日期 2013.09.30
申请人 International Business Machines Corporation 发明人 Erla Arundhathi;Gupta Ritesh K.;Patil Madhusmita P.;Patil Swetha;Rajagopalan Ramesh;Thomas Bijo A.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Schmeiser, Olsen & Watts 代理人 Schmeiser, Olsen & Watts ;Pivnichny John
主权项 1. A method comprising: linking, by a computer processor of a data hub, source tables and target tables to a data hub application configured to embed, extract, transform, and load (ETL) processes; associating, by said computer processor, said source tables and said target tables to a local sensitive hashing (LSH) program comprising target flags; transferring, by said computer processor, metadata associated with said source tables and said target tables from virtual views of said data hub application to an ETL work area of said ETL processes, wherein said metadata comprises table definition metadata published to a DS tool comprising said source tables and said target tables; linking, by said computer processor, ETL job to said data hub application; executing, by said computer processor executing a data hub scheduler application, said ETL processes; determining, by said computer processor, results of said executing, wherein said results indicate that said executing was not successful; and detecting and syncing, by said computer processor, changes to said metadata associated with job failure sensing, wherein said detecting and syncing comprises: analyzing, by said computer processor, a log file pattern indicating a reason that said executing was not successful;decoding, by said computer processor, said log file pattern;generating, by said computer processor, a change script based on a category of said log file pattern, wherein said change script comprises an exception routine associated with non-classified categories;notifying, by said computer processor, users of said changes to said metadata and said change script; andtriggering, by said computer processor executing said change script, enabling updated changes to said metadata.
地址 Armonk NY US