发明名称 TRANSFORMING AND LOADING DATA UTILIZING IN-MEMORY PROCESSING
摘要 A system includes at least one processor and processes an ETL job. The system analyzes a specification of the ETL job including one or more functional expressions to load data from one or more source data stores, process the data in memory, and store the processed data to one or more target data stores. One or more data flows are produced from the specification based on the one or more functional expressions. The one or more data flows utilize in-memory distributed data sets generated to accommodate parallel processing for loading and processing the data. The one or more data flows are optimized to assign operations to be performed on the one or more source data stores. The optimized data flows are executed to load the data to the one or more target data stores in accordance with the specification. Present invention embodiments further include methods and computer program products.
申请公布号 US2017075966(A1) 申请公布日期 2017.03.16
申请号 US201615227265 申请日期 2016.08.03
申请人 International Business Machines Corporation 发明人 Greene Lawrence A.;Li Yong;Pu Xiaoyan;Sheng Yeh-Heng
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method of processing an Extract, Transform, Load (ETL) job comprising: analyzing a specification of the ETL job including one or more functional expressions to load data from one or more source data stores, process the data in memory, and store the processed data to one or more target data stores; producing one or more data flows from the specification based on the one or more functional expressions, wherein the one or more data flows utilize in-memory distributed data sets generated to accommodate parallel processing for loading and processing the data; optimizing the one or more data flows to assign operations to be performed on the one or more source data stores; and executing the optimized data flows to load the data to the one or more target data stores in accordance with the specification.
地址 Armonk NY US