发明名称 |
Dynamic job relocation in a high performance computing system |
摘要 |
A method and apparatus is described for dynamic relocation of a job executing on multiple nodes of a high performance computing (HPC) systems. The job is dynamically relocated when the messaging network is in a quiescent state. The messaging network is quiesced by signaling the job to suspend execution at a global collective operation of the job where the messaging of the job is known to be in a quiescent state. When all the nodes have reached the global collective operation and paused, the job is relocated and execution is resumed at the new location.
|
申请公布号 |
US8516487(B2) |
申请公布日期 |
2013.08.20 |
申请号 |
US20100703922 |
申请日期 |
2010.02.11 |
申请人 |
FELTON MITCHELL DENNIS;LUCAS RAY LEROY;SOLIE KARL MICHAEL;INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
FELTON MITCHELL DENNIS;LUCAS RAY LEROY;SOLIE KARL MICHAEL |
分类号 |
G06F9/46 |
主分类号 |
G06F9/46 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|