发明名称 RELIABLE FAULT RESOLUTION IN CLUSTER
摘要 <p><P>PROBLEM TO BE SOLVED: To provide a method and a system to find the location of a fault, and to resolve it in a cluster environment. <P>SOLUTION: The cluster (100) consists of at least one multi-homed node (110), at least one gateway (140) for each of network interfaces (112, 114). Heartbeat messages are sent between a peer-node and the gateway at a predetermined periodic interval. If there is a loss of the heartbeat messages by any node or gateway occurs, an ICMP echo is issued to each node and gateway in the cluster on a network interface basis. When the ICMP echo is responded to and a node loss or a network loss is not identified, an application-level ping is issued, to determine whether the fault related to the heartbeat message missing is a transient error condition or an application software fault. <P>COPYRIGHT: (C)2005,JPO&NCIPI</p>
申请公布号 JP2005073277(A) 申请公布日期 2005.03.17
申请号 JP20040246154 申请日期 2004.08.26
申请人 INTERNATL BUSINESS MACH CORP <IBM> 发明人 RAO SUDHIR;JACKSON BRUCE;DAVIS MARK;SRIDHARA SRIKANTH
分类号 H04L12/26;G06F11/00;G06F11/07;H04L12/24;H04L12/28;H04L12/56;H04L12/66;H04L29/10;(IPC1-7):H04L12/26 主分类号 H04L12/26
代理机构 代理人
主权项
地址