发明名称 METHOD AND A SYSTEM FOR RELIABLY RESOLVING A FAILURE IN A CLUSTER, PARTICULARLY FOR SUITABLY OVERCOMING A FAILURE BY CHECKING A STARTING POINT OF THE FAILURE BY DETECTING AND SEPARATING THE FAILURE
摘要 PURPOSE: A method and system for reliably resolving a failure in a cluster are provided to reliably and effectively detect a failure from a highly used integrated architecture and solve the failure. CONSTITUTION: A heartbeat message is transmitted to a peer node to monitor detection of a failure(202). It is determined whether there is a loss of a heartbeat in one network interface(204). If there is a loss of a heartbeat, a node which has detected the loss of the heartbeat issues an ICMP(Internet Control Message Protocol) echo(206). It is checked whether at least one echo return has been received with respect to one network interface(208). If at least one echo return has been received, it means that a network path is being properly functioned(210). Echo responses received from a target node set with respect to network interfaces are compared to determine a network path with an optimum connectivity according to one network interface in a cluster(212). It is checked whether connectivity of a different network path has been improved(214). If connectivity of the different network path has been improved, a network path failure is overcome(216).
申请公布号 KR20050022329(A) 申请公布日期 2005.03.07
申请号 KR20040065873 申请日期 2004.08.20
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 DAVIS MARK;JACKSON BRUCE;RAO SUDHIR;SRIDHARA SRIKANATH
分类号 H04L12/26;G06F11/00;G06F11/07;H04L12/24;H04L12/28;H04L12/56;H04L12/66;H04L29/10;(IPC1-7):H04L12/66 主分类号 H04L12/26
代理机构 代理人
主权项
地址