发明名称 PROACTIVE AND ADAPTIVE CLOUD MONITORING
摘要 Processes, computer-readable media, and machines are disclosed for reducing a likelihood that active functional components fail in a computing system. An active monitoring component receives metrics associated with different active functional components of a computing system. The different active functional components contribute to different functionalities of the system. Based at least in part on the metrics associated with a particular active functional component, the active monitoring component determines that the particular active functional component has reached a likelihood of failure but has not failed. In response to determining that the particular active functional component has reached the likelihood of failure but has not failed, the active monitoring component causes a set of actions that are predicted to reduce the likelihood of failure.
申请公布号 US2015095720(A1) 申请公布日期 2015.04.02
申请号 US201414556357 申请日期 2014.12.01
申请人 ORACLE INTERNATIONAL CORPORATION 发明人 Srivastava Deepti;Ingham Andrew;Hsu Cheng-Lu;Chan Wilson Wai Shun
分类号 G06F11/07 主分类号 G06F11/07
代理机构 代理人
主权项 1. A method comprising: determining that an active functional component of a computer system has reached a likelihood of failure based at least in part on a first value of a metric exceeding a metric threshold; wherein the first value of the metric reflects performance of the active functional component at a time before a first action is caused to be performed in an attempt to reduce the likelihood of failure of the active functional component; after causing the first action to be performed, determining that the active functional component has again reached the likelihood of failure based at least in part on a second value of the metric exceeding the metric threshold; wherein the second value of the metric reflects performance of the active functional component at a time after the first action is caused to be performed but before a second action, that is different from the first action, is caused to be performed in an attempt to reduce the likelihood of failure of the active functional component; after causing the second action to be performed, obtaining a third value for the metric reflecting performance of the active functional component at a time after the second action is caused to be performed; and using a machine learning component to determine a third action to perform to reduce the likelihood of failure of the active functional component based at least in part on the second and third values of the metric and the first and second actions caused to be performed; wherein the method is performed by one or more computing devices.
地址 Redwood Shores CA US