发明名称 System and multi-thread method to manage a fault tolerant computer switching cluster using a spanning tree
摘要 A system, method and computer program to detect and recover from a communications failure in a computer network. The computer network has several nodes which include processor-based systems, input/output controllers and network controllers. Each node has a cluster adapter connected to multiple port switches through communications links. Data is transmitted through among the nodes through the communications links in the form of packets. A fabric manager module will monitor the network and detect a link failure. Upon the detection of a link failure between two switches a spanning tree partitioning module will partition the network into two trees at the point of the link failure. Thereafter, a link and switch identification module will identify a link between the two trees that can replace the failed link and has the least impact on the network. A routing table calculation algorithm module will calculate a new routing and distance table based on the identified link. The fabric manager module will then download the routing and distance table to only those switches effected by the new link selected to replace the failed link. This identification and recovery from communications link failures may be done with little overhead and without taking the network offline.
申请公布号 US6757242(B1) 申请公布日期 2004.06.29
申请号 US20000538264 申请日期 2000.03.30
申请人 INTEL CORPORATION 发明人 WANG JENLONG;YANG HUNGJEN (SEAN);SCHLOBOHM BRUCE;SWORTWOOD WILLIAM H.
分类号 H04L12/56;H04L29/06;H04L29/08;(IPC1-7):G01R31/08 主分类号 H04L12/56
代理机构 代理人
主权项
地址