发明名称 |
Node death detection by querying |
摘要 |
Systems, methods, and other embodiments associated with detecting a node death in a clustered distributed system are described. In one embodiment, a method includes transmitting a ping message to a peer node in the network. If a reply to the ping message is not received from the peer node, a query is sent to table of port identifiers that lists ports in the cluster. In one embodiment, the query includes a port identifier associated with the peer node. The peer node is declared as inactive/dead when the query fails to locate a match in the table for the port identifier. When the query locates a match in the table for the port identifier, another ping message is periodically transmitted to the peer node. |
申请公布号 |
US9135097(B2) |
申请公布日期 |
2015.09.15 |
申请号 |
US201213431196 |
申请日期 |
2012.03.27 |
申请人 |
ORACLE INTERNATIONAL CORPORATION |
发明人 |
Sridharan Vijay;Nguyen Huy;Shi Jia;Tsukerman Alex;Umamageswaran Kothanda |
分类号 |
G06F11/00;G06F11/07;H04L12/26 |
主分类号 |
G06F11/00 |
代理机构 |
Kraguljac Law Group, LLC |
代理人 |
Kraguljac Law Group, LLC |
主权项 |
1. A non-transitory computer-readable medium storing computer-executable instructions that when executed by a computer in a cluster of nodes cause the computer to perform functions, the computer-executable instructions comprising instructions configured for:
transmitting ping messages to a peer node at periodic intervals; tracking a number of miss counts for the peer node, wherein a miss count is when the peer node does not send a reply to a ping message; when the miss count for the peer node triggers a suspicion threshold, querying a subnet manager of the cluster for a translation of a port identifier associated with the peer node; and determining that the peer node is dead when the translation from the subnet manager fails. |
地址 |
Redwood Shore CA US |