发明名称 JOINT OPTIMIZATION OF MULTIPLE PHASES IN LARGE DATA PROCESSING
摘要 Methods and arrangements for task scheduling. A plurality of jobs is received, each job comprising at least a map phase, a copy/shuffle phase and a reduce phase. For each job, there are determined a map phase execution time and a copy/shuffle phase execution time. Each job is classified into at least one group based on at least one of: the determined map phase execution time and the determined copy/shuffle phase execution time. The plurality of jobs are executed via processor sharing, and the executing includes determining a similarity measure between jobs based on current job execution progress. Other variants and embodiments are broadly contemplated herein.
申请公布号 US2014380320(A1) 申请公布日期 2014.12.25
申请号 US201313922746 申请日期 2013.06.20
申请人 International Business Machines Corporation 发明人 Lin Minghong;Tan Jian;Zhang Li
分类号 G06F9/48 主分类号 G06F9/48
代理机构 代理人
主权项 1. A method comprising: utilizing at least one processor to execute computer code configured to perform the steps of: receiving a plurality of jobs, each job comprising at least a map phase, a copy/shuffle phase and a reduce phase; determining, for each job, a map phase execution time and a copy/shuffle phase execution time; classifying each job into at least one group based on at least one of: the determined map phase execution time and the determined copy/shuffle phase execution time; and executing the plurality of jobs via processor sharing; said executing comprising determining a similarity measure between jobs based on current job execution progress.
地址 Armonk NY US