发明名称 Method and architecture for automated fault diagnosis and correction in a computer system
摘要 A method, apparatus, and computer program product diagnosing and resolving faults is disclosed. A disclosed fault management architecture includes a fault manager suitable having diagnostic engines and fault correction agents. The diagnostic engines receive error information and identify associated fault possibilities. The fault possibility information is passed to fault correction agents, which diagnose and resolve the associated faults. The architecture uses logs to track the status of error information, the status of fault management exercises, and the fault status of system resources. Additionally, a soft error rate discriminator can be employed to track and resolve soft (correctible) errors in the system. The architecture is extensible allowing additional diagnostic engines and agents to be plugged in to the architecture without interrupting the normal operational flow of the computer system.
申请公布号 US2005102567(A1) 申请公布日期 2005.05.12
申请号 US20030698989 申请日期 2003.10.31
申请人 MCGUIRE CYNTHIA A.;HALEY TIMOTHY P.;RUDOFF ANDREW M.;SHAPIRO MICHAEL W.;SIMMONS MATTHEW T. 发明人 MCGUIRE CYNTHIA A.;HALEY TIMOTHY P.;RUDOFF ANDREW M.;SHAPIRO MICHAEL W.;SIMMONS MATTHEW T.
分类号 G06F11/25;(IPC1-7):G06F11/00 主分类号 G06F11/25
代理机构 代理人
主权项
地址