发明名称 Dynamic job relocation in a high performance computing system
摘要 A method and apparatus is described for dynamic relocation of a job executing on multiple nodes of a high performance computing (HPC) systems. The job is dynamically relocated when the messaging network is in a quiescent state. The messaging network is quiesced by signaling the job to suspend execution at a global collective operation of the job where the messaging of the job is known to be in a quiescent state. When all the nodes have reached the global collective operation and paused, the job is relocated and execution is resumed at the new location.
申请公布号 US8516487(B2) 申请公布日期 2013.08.20
申请号 US20100703922 申请日期 2010.02.11
申请人 FELTON MITCHELL DENNIS;LUCAS RAY LEROY;SOLIE KARL MICHAEL;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 FELTON MITCHELL DENNIS;LUCAS RAY LEROY;SOLIE KARL MICHAEL
分类号 G06F9/46 主分类号 G06F9/46
代理机构 代理人
主权项
地址