发明名称 Checkpointing for a hybrid computing node
摘要 According to an aspect, a method for checkpointing in a hybrid computing node includes executing a task in a processing accelerator of the hybrid computing node. A checkpoint is created in a local memory of the processing accelerator. The checkpoint includes state data to restart execution of the task in the processing accelerator upon a restart operation. Execution of the task is resumed in the processing accelerator after creating the checkpoint. The state data of the checkpoint are transferred from the processing accelerator to a main processor of the hybrid computing node while the processing accelerator is executing the task.
申请公布号 US9280383(B2) 申请公布日期 2016.03.08
申请号 US201414302921 申请日期 2014.06.12
申请人 International Business Machines Corporation 发明人 Cher Chen-Yong
分类号 G06F9/46;G06F9/48 主分类号 G06F9/46
代理机构 Cantor Colburn LLP 代理人 Cantor Colburn LLP ;Stock William
主权项 1. A method for checkpointing in a hybrid computing node, the method comprising: executing a task in a processing accelerator of the hybrid computing node; creating a checkpoint in a local memory of the processing accelerator, the checkpoint comprising state data to restart execution of the task in the processing accelerator upon a restart operation; resuming execution of the task in the processing accelerator after creating the checkpoint; and transferring the state data of the checkpoint from the processing accelerator to a main processor of the hybrid computing node while the processing accelerator is executing the task, wherein the transferring is performed asynchronously on a lower bandwidth interface with the main processor, and a higher speed interface is used to create the checkpoint.
地址 Armonk NY US