发明名称 |
Telemetry data analysis using multivariate sequential probability ratio test |
摘要 |
One embodiment provides a system that analyzes telemetry data from a monitored system. During operation, the system periodically obtains the telemetry data as a set of telemetry variables from the monitored system and updates a multidimensional real-time distribution of the telemetry data using the obtained telemetry variables. Next, the system analyzes a statistical deviation of the multidimensional real-time distribution from a multidimensional reference distribution for the monitored system using a multivariate sequential probability ratio test (SPRT) and assesses the integrity of the monitored system based on the statistical deviation of the multidimensional real-time distribution. If the assessed integrity falls below a threshold, the system determines a fault in the monitored system corresponding to a source of the statistical deviation. |
申请公布号 |
US9152530(B2) |
申请公布日期 |
2015.10.06 |
申请号 |
US200912454226 |
申请日期 |
2009.05.14 |
申请人 |
ORACLE AMERICA, INC. |
发明人 |
Gross Kenny C.;Dhanekula Ramakrishna C.;Urmanov Aleksey M. |
分类号 |
G06F17/18;G06F11/34;G06F11/14;G06F11/30;G06F11/07 |
主分类号 |
G06F17/18 |
代理机构 |
Park, Vaughan, Fleming & Dowler LLP |
代理人 |
Park, Vaughan, Fleming & Dowler LLP ;Suen Chia-Hsin |
主权项 |
1. A computer-implemented method for analyzing telemetry data from a monitored system, comprising, in at least one computer, performing one or more operations for:
periodically obtaining the telemetry data as a set of telemetry variables from the monitored system; using the obtained telemetry variables to update a multidimensional real-time distribution of the telemetry data that represents a multidimensional distribution of pointwise differences in values of the telemetry variables observed at the monitored system and values of the telemetry variables observed at a known-good system, wherein a first dimension for the multidimensional real-time distribution corresponds to a first type of the telemetry variables, and wherein a second dimension for the multidimensional real-time distribution corresponds to a second type of the telemetry variables that is different from the first type; using a multivariate sequential probability ratio test (SPRT) to analyze a statistical deviation of a parameter of the multidimensional real-time distribution from a parameter of a multidimensional reference distribution for the monitored system that represents a multidimensional distribution of pointwise differences in values of the telemetry variables observed at a normally operating system and values of the telemetry variables observed at the known-good system, and wherein using the SPRT comprises determining a drift in values of a matrix for the parameter of the multidimensional real-time distribution from values of a matrix for the parameter of the multidimensional real-time distribution; assessing the integrity of the monitored system based on the drift, wherein assessing the integrity comprises, when the drift falls below a threshold, determining a fault in the monitored system corresponding to a source of the statistical deviation; determining a degrading field replaceable unit (FRU) of the monitored system that caused the statistical deviation by using at least some data in the multidimensional real-time distribution; and replacing the degrading FRU. |
地址 |
Redwood Shores CA US |