发明名称 HEALTH MONITORING AND RECOVERY FOR INFRASTRUCTURE DEVICES
摘要 Automated health monitoring and recovery is provided for infrastructure devices supporting server devices in a data center. Health analysis operations may be selected to be performed on an infrastructure device based on the capabilities of the infrastructure device and/or how the infrastructure device is being used to support server devices in the data center. If the infrastructure device is unhealthy, an automated recovery operation may be performed. The automated recovery operation may include recovery actions selected based on the capabilities of the infrastructure device, the failure mode of the infrastructure device, and/or how the infrastructure device is being used to support server devices in the data center.
申请公布号 US2015212901(A1) 申请公布日期 2015.07.30
申请号 US201414164829 申请日期 2014.01.27
申请人 MICROSOFT CORPORATION 发明人 AGGARWAL CHANDAN;YAQOOB ASAD;MCKONE JOSH DAVID;EASON MATTHEW JEREMIAH;MERCHANT AKIL M.
分类号 G06F11/14;G06F11/34 主分类号 G06F11/14
代理机构 代理人
主权项 1. One or more computer storage media storing computer-usable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform operations comprising: triggering a health monitoring event for an infrastructure device supporting one or more server devices in a data center; identifying device information for the infrastructure device; determining an operational context of the infrastructure device in supporting the one or more server devices in the data center; determining a health monitoring process for the infrastructure device based on the device information for the infrastructure device and the operational context of the infrastructure device in supporting the one or more server devices in the data center; performing the determined health monitoring process for the infrastructure device to assess the health of the infrastructure device; determining to perform an automated recovery operation for the infrastructure device based on the health of the infrastructure device; in response to determining to perform the automated recovery operation for the infrastructure device, determining one or more recovery actions for the automated recovery operation based on the device information for the infrastructure device, the operational context of the infrastructure device in supporting the one or more server devices in the data center, and a failure context of the infrastructure device determined from the health monitoring process for the infrastructure device; and performing at least a portion of the one or more recovery actions for the infrastructure device.
地址 Redmond WA US