发明名称 |
ENHANCED RECOVERY OF HIGHLY AVAILABLE COMPUTING SYSTEMS |
摘要 |
Exemplary embodiments disclose a method and system for detecting a failure and resuming processing in a computing system encompassing at least two sites, a primary site and a secondary site. In a module, an exemplary embodiment generates a record of a logically consistent state and data of system components of the primary site periodically and transfers the record of a logically consistent state and data of system components of the primary site to the secondary site. In another module, an exemplary embodiment detects a failure in the primary site, halts the generation of the record of a logically consistent state and data of system components of the primary site periodically with a data freeze function, and resumes a processing of the primary site on the secondary site with secondary site components updated with a most recent logically consistent state and data of system components of the primary site. |
申请公布号 |
US2014173341(A1) |
申请公布日期 |
2014.06.19 |
申请号 |
US201213719426 |
申请日期 |
2012.12.19 |
申请人 |
INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
Kern Robert F.;Petersen David B. |
分类号 |
G06F11/14 |
主分类号 |
G06F11/14 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for detecting a failure and resuming processing in a computing system encompassing at least two sites, a primary site and a secondary site, the method comprising:
generating a record of a logically consistent state and data of system components of the primary site periodically; transferring a record of a logically consistent state and data of system components of the primary site to the secondary site; updating a state and data of system components of the secondary site with the contents of a record of a logically consistent state and data of system components of the primary site; detecting a failure in the primary site; halting the generation of a record of a logically consistent state and data of system components of the primary site periodically with a data freeze function upon detecting a failure; and resuming a processing of the primary site on the secondary site with secondary site components updated with a most recent logically consistent state and data of system components of the primary site.
|
地址 |
Armonk NY US |