发明名称 SYSTEM AND METHOD OF MONITORING AND MEASURING CLUSTER PERFORMANCE HOSTED BY AN IAAS PROVIDER BY MEANS OF OUTLIER DETECTION
摘要 The present disclosure is directed to a system for monitoring and analyzing operation of a widely distributed service operated by an Infrastructure-as-a-Service (IaaS) tenant but deployed on a set of virtual resources controlled by an independent IaaS provider. The set of virtual resources can be organized into clusters in which resources are expected to behave similarly to each other. Virtual resources that do not behave similar to peer resources in the same cluster, i.e., outliers, may be indicative of problems that need to be addressed. The monitoring system can collect performance metric data from virtual resources, and compare the performance of each virtual resource in a cluster with the performance of every other virtual resource in the cluster to detect outliers. This comparison can involve correlation analysis, ANOVA analysis, or regression analysis.
申请公布号 US2015081881(A1) 申请公布日期 2015.03.19
申请号 US201314144980 申请日期 2013.12.31
申请人 Stackdriver, Inc. 发明人 EATON Patrick Randolph
分类号 H04L12/26 主分类号 H04L12/26
代理机构 代理人
主权项 1. A system for detecting outlier virtual resources that perform differently from peer virtual resources within a cluster of virtual resources that are expected to perform similarly, the virtual resources being provided by an independent Infrastructure-as-a-Service (IaaS) provider to an IaaS tenant for operating a widely distributed service, wherein the widely distributed service may be at least one of geographically dispersed, part of different communication networks, and disjoint, wherein the IaaS provider is responsible for selection of resources, wherein an operational capacity of the resources may change substantially and rapidly, and wherein the IaaS tenant has no direct control over, and limited visibility into, the selection of resources, the system comprising: a data gateway configured to collect substantially live system-level metrics related to the operation of the cluster of resources; and an analysis module configured to: compare the live system-level metrics for resources in the cluster of resources against live system-level metrics for other peer resources in the cluster of resources; anddetermine that a candidate resource is an outlier if a difference between the live system-level metrics for the candidate resource and the live system-level metrics for other peer resources in the cluster of resources exceeds a predetermined threshold, wherein the determination that the candidate resource is an outlier includes at least one of performing correlation analysis on the candidate resource, performing ANOVA analysis on the candidate resource, and performing regression analysis on the candidate resource.
地址 Boston MA US