发明名称 Automated fault and recovery system
摘要 A mechanism is provided for handling incidents occurring in a managed environment. An incident is detected in a resource in the managed environment. A set of incident handling actions are identified based on incident handling rules for an incident type of the incident. From the set of incident handling actions, one incident handling action is identified to be executed based on a set of impact indicators associated with the set of incident handling rules. The identified incident handling action is then executed to address the failure of the resource.
申请公布号 US9058265(B2) 申请公布日期 2015.06.16
申请号 US201213710710 申请日期 2012.12.11
申请人 International Business Machines Corporation 发明人 Behrendt Michael M.;Hosn Rafah A.;Mahindru Ruchi;Ramasamy HariGovind V.;Sarkar Soumitra;Viswanathan Mahesh;Vogl Norbert G.
分类号 G06F11/00;G06F11/07 主分类号 G06F11/00
代理机构 代理人 Lammas Francis;Walder, Jr. Stephen J.;Stock William J.
主权项 1. A method, in a data processing system, for handling incidents occurring in a managed environment, the method comprising: detecting, by a processor, an incident in a resource in the managed environment; identifying, by the processor, a set of incident handling actions based on incident handling rules for an incident type of the incident; from the set of incident handling actions, identifying, by the processor, one incident handling action to be executed based on a set of impact indicators associated with the set of incident handling rules, the set of impact indicators indicating an impact to one or more other resources within the managed environment and the one or more other resources being one or more of hardware, virtualization software, virtual machines, operating systems, middleware, or applications, wherein a value of an impact indicator in the set of impact indicators increases as either the number of tenants effected by the incident handling action with which the impact indicator is associated increases or the number of other applications effected by the incident handling action with which the impact indicator is associated increases; and executing, by the processor, the identified incident handling action to address the failure of the resource.
地址 Armonk NY US