发明名称 Critical systems inspector
摘要 Techniques are described for identifying a root cause of a pattern of performance data in a system including a plurality of services. Embodiments provide dependency information for each of the plurality of services, where at least one of the plurality of services is dependent upon a first one of the plurality of services. Each of the plurality of services is monitored to collect performance data for the respective service. Embodiments further analyze the performance data to identify a cluster of services that each follow a pattern of performance data. The first one of the services in the cluster of services is determined to be a root cause of the pattern of performance data, based on the determined dependency information for each of the plurality of services.
申请公布号 US9582395(B2) 申请公布日期 2017.02.28
申请号 US201313826942 申请日期 2013.03.14
申请人 NETFLIX, INC. 发明人 Tuffs Philip Simon;Rapoport Roy;Tseitlin Ariel
分类号 G06F11/00;G06F11/34;G06F11/07 主分类号 G06F11/00
代理机构 Artegis Law Group, LLP 代理人 Artegis Law Group, LLP
主权项 1. A method, comprising: monitoring a plurality of services; collecting performance data for each service included in the plurality of services; identifying, by operation of one or more computer processors, a cluster of related services by analyzing the performance data and determining that each service included in the cluster of related services exhibits a statistically similar pattern of performance data with respect to one or more performance metrics, wherein a first service is included in the cluster of related services, by: calculating a similarity value for at least one service included in the plurality of services that is indicative of a statistical similarity between the performance data of the service and at least one of the performance data of the first service, the performance data of a second service, and a predetermined statistical pattern, anddetermining that the similarity values calculated for each service included in the cluster of related services exceed a predetermined threshold amount of similarity; and determining, by operation of one or more computer processors, that the first service is a root cause of the pattern of performance data for each service included in the cluster of related services by determining that each service included in the cluster of related services, other than the first service, depends, either directly or indirectly, on the first service.
地址 Los Gatos CA US