发明名称 COLLECTIVELY LOADING PROGRAMS IN A MULTIPLE PROGRAM MULTIPLE DATA ENVIRONMENT
摘要 Techniques are disclosed for loading programs efficiently in a parallel computing system. In one embodiment, nodes of the parallel computing system receive a load description file which indicates, for each program of a multiple program multiple data (MPMD) job, nodes which are to load the program. The nodes determine, using collective operations, a total number of programs to load and a number of programs to load in parallel. The nodes further generate a class route for each program to be loaded in parallel, where the class route generated for a particular program includes only those nodes on which the program needs to be loaded. For each class route, a node is selected using a collective operation to be a load leader which accesses a file system to load the program associated with a class route and broadcasts the program via the class route to other nodes which require the program.
申请公布号 US2014282599(A1) 申请公布日期 2014.09.18
申请号 US201313800948 申请日期 2013.03.13
申请人 International Business Machines Corporation 发明人 AHO Michael E.;ATTINELLA John E.;GOODING Thomas M.;MILLER Samuel J.
分类号 G06F9/48 主分类号 G06F9/48
代理机构 代理人
主权项 1. A computer program product for loading a plurality of programs associated with a computing job in a parallel computing system including multiple nodes, the computer program product comprising: a computer-readable storage medium having computer-readable program code embodied therewith, the computer-readable program code comprising: computer-readable program code configured to, for each program, generate a respective class route, the class route including a set of one or more compute nodes of the parallel computing system which are to execute the program as part of performing the computing job; andcomputer-readable program code configured to, for each class route: select one of the nodes of the class route to load one of the programs which corresponds to the class route,load, by the selected node, the one of the programs from a file system, andbroadcast, by the selected node, the one of the programs along the class route.
地址 Armonk NY US