发明名称 Recovery segments
摘要 In one example, a method for implementing recovery segments includes sending an application message from a parent process executed by a first computing device to a child process executed by a second computing device and identifying a dependency created by the application message. This identified dependency is included in a dependence set of the child process and saved. A checkpoint is generated by the parent process and a checkpoint message that includes dependency information is sent from the parent process to the child process. The child process modifies the dependence set according to the dependency information and generates a second checkpoint that is saved in nonvolatile memory of the second computing device. Upon occurrence of a failure of the parent process, the child process reverts to a most recent checkpoint generated by the child process that does not include the effects of processing an orphan message.
申请公布号 US8909987(B2) 申请公布日期 2014.12.09
申请号 US201213458680 申请日期 2012.04.27
申请人 Hewlett-Packard Development Company, L. P. 发明人 Karp Alan H.;Chakrabarti Dhruva;Golab Wojciech;Kelly Terence P.
分类号 G06F11/00 主分类号 G06F11/00
代理机构 代理人
主权项 1. A method for implementing recovery segments in a large scale computing application comprising: sending an application message from a parent process executed by a first computing device to a child process executed by a second computing device, in which the recovery segment comprises the parent process and the child process; identifying a dependency created by the application message; including the identified dependency in a dependence set of the child process and saving the dependence set in memory of the second computing device; generating, by the parent process, a first checkpoint and saving the first checkpoint in nonvolatile memory of the first computing device; sending, from the parent process to a child process, a checkpoint message that includes dependency information; receiving, by the child process, the checkpoint message and modifying the dependence set of the child process according to the dependency information; generating, by the child process, a second checkpoint and saving the second checkpoint in nonvolatile memory of the second computing device; upon occurrence of a failure of the parent process, reverting the child process to a most recent checkpoint generated by the child process that does not include effects of processing an orphan message.
地址 Houston TX US