发明名称 |
Systems and methods for preventing split-brain scenarios in high-availability clusters |
摘要 |
A computer-implemented method for preventing split-brain scenarios in high-availability clusters may include (1) detecting, at a first node of a high-availability cluster, a partitioning event that isolates the first node from a second node of the high-availability cluster, (2) broadcasting, from a health-status server and after the partitioning event has occurred, a cluster-health message to the first node that includes at least a health status of the second node that is based on whether the health-status server received a node-health message from the second node, and (3) reacting, at the first node and based at least in part on whether the first node received the cluster-health message, to the partitioning event such that the partitioning event does not result in a split-brain scenario within the high-availability cluster. Various other methods, systems, and computer-readable media are also disclosed. |
申请公布号 |
US9450852(B1) |
申请公布日期 |
2016.09.20 |
申请号 |
US201414146804 |
申请日期 |
2014.01.03 |
申请人 |
Juniper Networks, Inc. |
发明人 |
Chen Jian;Gu Jianhua;Ng Anthony Shun Leung;Lin Kuen-Der Christopher;Tang Xiaochen |
分类号 |
G06F11/00;H04L12/26 |
主分类号 |
G06F11/00 |
代理机构 |
ALG Intellectual Property, LLC |
代理人 |
ALG Intellectual Property, LLC |
主权项 |
1. A computer-implemented method comprising:
detecting, at an active node of a high-availability cluster, a partitioning event that isolates the active node from a standby node of the high-availability cluster; after the partitioning event has occurred:
broadcasting, from a health-status server, a cluster-health message to at least the active node, wherein:
the health-status server is separate and distinct from the active node and the standby node;the cluster-health message comprises at least a health status of the standby node;the health status of the standby node is based at least in part on whether the health-status server received a node-health message from the standby node after the partitioning event occurred;reacting, at the active node, to the partitioning event such that the partitioning event does not result in a split-brain scenario within the high-availability cluster by performing, based at least in part on whether the active node received the cluster-health message from the health-status server, at least one of:
yielding, at the active node and in response to not receiving the cluster-health message from the health-status server, at least one computing task assigned to the active node to the standby node;continuing to perform, at the active node and in response to receiving the cluster-health message from the health-status server, the at least one computing task assigned to the active node. |
地址 |
Sunnyvale CA US |