发明名称 Autonomic recovery from hardware errors in an input/output fabric
摘要 An apparatus, program product and method propagate errors detected in an IO fabric element from an IO fabric that is used to couple a plurality of endpoint IO resources to processing elements in a computer. In particular, such errors are propagated to the endpoint IO resources affected by the IO fabric element in connection with recovering from the errors in the IO fabric element. By doing so, a device driver or other program code used to access each affected IO resources may be permitted to asynchronously recover from the propagated error in its associated IO resource, and often without requiring the recovery from the error in the IO fabric element to wait for recovery to be completed for each of the affected IO resources. In addition, an IO fabric may be dynamically configured to support both recoverable and non-recoverable endpoint IO resources. In particular, IO fabric elements within an IO fabric may be dynamically configured to enable machine check signaling in such IO fabric elements in response to detection that an endpoint IO resource is non-recoverable in nature. The IO fabric elements that are dynamically configured as such are disposed within a hardware path that is defined between the non-recoverable resource and a processor that accesses the non-recoverable resource.
申请公布号 US2004230861(A1) 申请公布日期 2004.11.18
申请号 US20030438392 申请日期 2003.05.15
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BAILEY DAVID ALAN;NGUYEN TRUNG NGOC;NORDSTROM GREGORY MICHAEL;PATEL KANISHA;THURBER STEVEN MARK
分类号 G06F9/46;G06F9/50;G06F11/00;H02H3/05;(IPC1-7):H02H3/05 主分类号 G06F9/46
代理机构 代理人
主权项
地址