发明名称 NODE CONTROLLER FIRST FAILURE ERROR MANAGEMENT FOR A DISTRIBUTED SYSTEM
摘要 A distributed system provides error handling wherein the system includes multiple nodes, each node being coupled to multiple node controllers for control redundancy. Multiple system controllers couple to the node controllers via a network bus. A particular node controller may detect an error of that particular node controller. The particular node controller may store error information relating to the detected error in respective nonvolatile memory stores in the system controllers and node controllers according to a particular priority order. In accordance with the particular priority order, for example, the particular node controller may first attempt to store the error information to a primary system controller memory store, then to a secondary system controller memory store, and then to sibling and non-sibling node controller memory stores. The primary system controller organizes available error information for use by system administrators and other resources of the distributed system.
申请公布号 US2011276822(A1) 申请公布日期 2011.11.10
申请号 US20100775195 申请日期 2010.05.06
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 ABDUL ANIS M.;MAHAJAN AJAY K.;PIETRANIEC NICHOLAS A.;MA ANDREA Y.
分类号 G06F11/20;G06F11/00 主分类号 G06F11/20
代理机构 代理人
主权项
地址