发明名称 Failover and recovery for replicated data instances
摘要 Replicated instances in a database environment provide for automatic failover and recovery. A monitoring component can periodically communicate with a primary and a secondary replica for an instance, with each capable of residing in a separate data zone or geographic location to provide a level of reliability and availability. A database running on the primary instance can have information synchronously replicated to the secondary replica at a block level, such that the primary and secondary replicas are in sync. In the event that the monitoring component is not able to communicate with one of the replicas, the monitoring component can attempt to determine whether those replicas can communicate with each other, as well as whether the replicas have the same data generation version. Depending on the state information, the monitoring component can automatically perform a recovery operation, such as to failover to the secondary replica or perform secondary replica recovery.
申请公布号 US9298728(B2) 申请公布日期 2016.03.29
申请号 US201314089616 申请日期 2013.11.25
申请人 Amazon Technologies, Inc. 发明人 McAlister Grant Alexander MacDonald;Sivasubramanian Swaminathan
分类号 G06F11/00;G06F17/30;G06F11/20;G06F11/14 主分类号 G06F11/00
代理机构 Meyertons, Hood, Kivlin, Kowert & Goetzel, P.C. 代理人 Kowert Robert C.;Meyertons, Hood, Kivlin, Kowert & Goetzel, P.C.
主权项 1. A computer-implemented method for managing a replicated database, comprising: under control of one or more computer systems configured with executable instructions, obtaining a generation identifier for a primary instance replica and a secondary instance replica of the replicated database upon initial pairing of the primary instance replica and the secondary instance replica, the primary instance replica and the secondary replica associated with a data environment;synchronizing data between the primary instance replica and the secondary instance replica using a block-level replication mechanism;periodically providing status information to a monitoring component of a control environment, the control environment being separate from the data environment;providing failure information to the monitoring component in response to the primary instance replica being unable to communicate with the secondary instance replica, the failure information including at least the generation identifier;determining that the primary instance replica is able to communicate with the monitoring component;obtaining a second generation identifier for the primary instance replica; andperforming one or more input/output (I/O) operations via the primary instance replica.
地址 Reno NV US