发明名称 Adaptive parallel data processing
摘要 Described herein are methods, systems, apparatuses and products for adaptive parallel data processing. An aspect provides providing a map phase in which at least one map function is applied in parallel on different partitions of input data at different mappers in a parallel data processing system; providing a communication channel between mappers using a distributed meta-data store, wherein said map phase comprises mapper data processing adapted responsive to communication with said distributed meta-data store; and providing data accessible by at least one reduce phase node in which at least one reduce function is applied. Other embodiments are disclosed.
申请公布号 US8954967(B2) 申请公布日期 2015.02.10
申请号 US201113149312 申请日期 2011.05.31
申请人 International Business Machines Corporation 发明人 Balmin Andrey;Beyer Kevin Scott;Ercegovac Vuk;Vernica Rares
分类号 G06F9/46;G06F9/50 主分类号 G06F9/46
代理机构 Ference & Associates LLC 代理人 Ference & Associates LLC
主权项 1. A computer program product comprising: a non-transitory computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising: computer readable program code configured to provide a map phase in which at least one map function is applied in parallel on different partitions of input data at different mappers in a parallel data processing system, and to perform the map phase; computer readable program code configured to provide a communication channel which enables communication between mappers using a distributed meta-data store; wherein to perform the map phase comprises: using the communication channel to permit each mapper to: post meta-data for use by at least one other mapper about its state; andreceive information regarding a state of at least one other mapper; andperforming mapper data processing which is adapted responsive in response to the communication which takes place between mappers via the distributed meta-data store; and computer readable program code configured to provide data accessible by at least one reduce phase node in which at least one reduce function is applied.
地址 Armonk NY US