发明名称 |
Checkpointing for a hybrid computing node |
摘要 |
According to an aspect, a method for checkpointing in a hybrid computing node includes executing a task in a processing accelerator of the hybrid computing node. A checkpoint is created in a local memory of the processing accelerator. The checkpoint includes state data to restart execution of the task in the processing accelerator upon a restart operation. Execution of the task is resumed in the processing accelerator after creating the checkpoint. The state data of the checkpoint are transferred from the processing accelerator to a main processor of the hybrid computing node while the processing accelerator is executing the task. |
申请公布号 |
US9280383(B2) |
申请公布日期 |
2016.03.08 |
申请号 |
US201414302921 |
申请日期 |
2014.06.12 |
申请人 |
International Business Machines Corporation |
发明人 |
Cher Chen-Yong |
分类号 |
G06F9/46;G06F9/48 |
主分类号 |
G06F9/46 |
代理机构 |
Cantor Colburn LLP |
代理人 |
Cantor Colburn LLP ;Stock William |
主权项 |
1. A method for checkpointing in a hybrid computing node, the method comprising:
executing a task in a processing accelerator of the hybrid computing node; creating a checkpoint in a local memory of the processing accelerator, the checkpoint comprising state data to restart execution of the task in the processing accelerator upon a restart operation; resuming execution of the task in the processing accelerator after creating the checkpoint; and transferring the state data of the checkpoint from the processing accelerator to a main processor of the hybrid computing node while the processing accelerator is executing the task, wherein the transferring is performed asynchronously on a lower bandwidth interface with the main processor, and a higher speed interface is used to create the checkpoint. |
地址 |
Armonk NY US |