发明名称 Memory error recovery method in a cluster computer and a cluster computer
摘要 In a cluster computer which connects a plurality of nodes that include memory and processors through an interconnection network, shutting down of a node due to an irrecoverable error that occurs in a common communication area is prevented, and the availability of the cluster computer is increased.A system control apparatus in each node sends a system error stop notification to the memory access origin when an irrecoverable error occurred during a memory access request generated in one node is sent to the node proper memory located on the same node step (S 17). However, a common communication area error notification is sent to the memory access origin (step S 10 and step S 18), when an irrecoverable error occurs when a memory access request generated in one node is sent to the common communication area of the memory located on another node and when a memory access request generated on one node is sent to the common communication area of the memory of the same node, and the node shut down is prevented.
申请公布号 US6782492(B1) 申请公布日期 2004.08.24
申请号 US19990433238 申请日期 1999.11.04
申请人 NEC CORPORATION 发明人 NAKASO HIROKO
分类号 G06F15/16;G06F11/00;G06F11/07;G06F15/177;(IPC1-7):G06F11/00 主分类号 G06F15/16
代理机构 代理人
主权项
地址