发明名称 System and Method for Limiting the Impact of Stragglers in Large-Scale Parallel Data Processing
摘要 A large-scale data processing system and method including a plurality of processes, wherein a master process assigns input data blocks to respective map processes and partitions of intermediate data are assigned to respective reduce processes. In each of the plurality of map processes an application-independent map program retrieves a sequence of input data blocks assigned thereto by the master process and applies an application-specific map function to each input data block in the sequence to produce the intermediate data and stores the intermediate data in high speed memory of the interconnected processors. Each of the plurality of reduce processes receives a respective partition of the intermediate data from the high speed memory of the interconnected processors while the map processes continue to process input data blocks an application-specific reduce function is applied to the respective partition of the intermediate data to produce output values.
申请公布号 US2013332931(A1) 申请公布日期 2013.12.12
申请号 US201313965108 申请日期 2013.08.12
申请人 GOOGLE INC. 发明人 MALEWICZ GRZEGORZ;DVORSKY MARIAN;COLOHAN CHRISTOPHER B.;THOMSON DEREK P.;LEVENBERG JOSHUA LOUIS
分类号 G06F9/54 主分类号 G06F9/54
代理机构 代理人
主权项
地址