发明名称 Methods and systems for reconstructing the state of a computation
摘要 Methods and systems for running and checkpointing parallel and distributed applications which does not require modification to the programs used in the system nor changes to the underlying operating system. One embodiment of the invention includes the following general steps: (1) starting an application on a parallel processing system; (2) controlling processes for the application, including recording of commands and responses; (3) controlling a commit protocol; (4) detecting failures of the application; (5) continuing execution of the application from the most recently committed transaction after "replaying" the recorded commands and responses. A second embodiment comprises the following general steps: (1) starting an application on a parallel processing system; (2) controlling processes for the application, including recurrent recording of the memory image of a driver program that controls the application; (3) controlling a commit protocol; (4) detecting failures of the application; (5) continuing execution of the application from the most recently committed transaction after "restoring" the recorded memory image of the driver program.
申请公布号 US5712971(A) 申请公布日期 1998.01.27
申请号 US19950570724 申请日期 1995.12.11
申请人 AB INITIO SOFTWARE CORPORATION 发明人 STANFILL, CRAIG;LASSER, CLIFF;LORDI, ROBERT
分类号 G06F11/14;(IPC1-7):G06F11/08 主分类号 G06F11/14
代理机构 代理人
主权项
地址
您可能感兴趣的专利