发明名称 |
Identifying likely faulty components in a distributed system |
摘要 |
In general, techniques are described for automatically identifying likely faulty components in massively distributed complex systems. In some examples, snapshots of component parameters are automatically repeatedly fed to a pre-trained classifier and the classifier indicates whether each received snapshot is likely to belong to a fault and failure class or to a non-fault/failure class. Components whose snapshots indicate a high likelihood of fault or failure are investigated, restarted or taken off line as a pre-emptive measure. The techniques may be applied in a massively distributed complex system such as a data center. |
申请公布号 |
US9064216(B2) |
申请公布日期 |
2015.06.23 |
申请号 |
US201313842909 |
申请日期 |
2013.03.15 |
申请人 |
Juniper Networks, Inc. |
发明人 |
Reddy Rajeshekar;Nakil Harshad Bhaskar |
分类号 |
G06F17/00;G06N5/02;G06N99/00;H04L12/26;H04L12/24 |
主分类号 |
G06F17/00 |
代理机构 |
Shumaker & Sieffert, P.A. |
代理人 |
Shumaker & Sieffert, P.A. |
主权项 |
1. A method of predicting component failure, the method comprising:
receiving, by a communication protocol and with a virtual network controller that includes an analytics plane to analyze operations of a plurality of components in one or more virtual networks, a first parameter set from each of the components, wherein a parameter set from a component includes one or more quantitative parameters that each describes a state of the component; receiving, by the communication protocol and with the virtual network controller, an indication of detected component failure for one or more of the components; training, with the virtual network controller and using the first parameter sets and the indication of detected component failure, a trainable automated classifier to develop a classifying structure that distinguishes between component parameter sets that logically associate with a detected component failure and component parameter sets that do not logically associate with a detected component failure; receiving, by the communication protocol and with the virtual network controller, a second parameter set from each of the components; and predicting, with the virtual network controller using the trainable automated classifier and the classifying structure, a failure of a first one of the components. |
地址 |
Sunnyvale CA US |