发明名称 Method and system for implementing interconnection fault tolerance between CPU
摘要 A system for implementing interconnection fault tolerance between CPUs, a first CPU and a second CPU implements interconnection through a first CPU interconnect device and a second CPU interconnect device. The system adds a data channel between a first SerDes interface of the first CPU interconnect device and a second SerDes interface of the second CPU interconnect device, and transmits link connection state information and a link control signal through the added data channel. The system monitors a link state of any one link in a CPU interconnection system, transmits the link state through the added data channel, recovers any one of the connection links when determining whether any one of the first connection link, the second connection link and the third connection link is faulty.
申请公布号 US8909979(B2) 申请公布日期 2014.12.09
申请号 US201213707188 申请日期 2012.12.06
申请人 Huawei Technologies Co., Ltd. 发明人 Chang Sheng;Wang Haibin;Zhang Jie;Yang Rongyu;Hou Xinyu
分类号 G06F11/00;G06F11/20;G06F11/07;G06F11/14 主分类号 G06F11/00
代理机构 Huawei Technologies Co., Ltd. 代理人 Huawei Technologies Co., Ltd.
主权项 1. A computer system for implementing fault tolerance, comprising: a first CPU; a second CPU; a first CPU interconnect device that comprises a first quick path interconnect (QPI) interface and a first serial deserial (SerDes) interface that are connected to each other; and a second CPU interconnect device that comprises a second QPI interface and a second SerDes interface that are connected to each other; wherein: the first CPU and the second CPU implement interconnection through a first connection link, a second connection link and a third connection link;the first connection link is established between a QPI interface of the first CPU and the first QPI interface to transmit data sent or received by the first CPU;the third connection link is established between a QPI interface of the second CPU and the second QPI interface to transmit data sent or received by the second CPU;the second connection link is established between the first SerDes interface and the second SerDes interface to transmit data between the first CPU and the second CPU;a fourth data channel is additionally established between the first SerDes interface and the second SerDes interface, the fourth data channel is configured to transmit a link state and a link control signal; wherein the computer system is configured to monitor the link state of any one of the first connection link, the second connection link and the third connection link, transmit the link state through the fourth data channel between the first CPU interconnect device and the second CPU interconnect device, recover any one of the connection links when determining whether any one of the first connection link, the second connection link and the third connection link is faulty.
地址 Shenzhen CN