发明名称 Grouping Failure Events with Adaptive Polling and Sliding Window Buffering
摘要 Embodiments detect and group multiple failure events to enable batch processing of those failure events, such as in a virtual datacenter executing a plurality of virtual machines (VMs). A long timer, adaptive short timer, and adaptive polling frequency enable a computing device to efficiently detect and group the failure events that may be related (e.g., resulting from one failure). The grouped failure events are processed in parallel thereby reducing the time for recovery from the failure events.
申请公布号 US2017060660(A1) 申请公布日期 2017.03.02
申请号 US201615347536 申请日期 2016.11.09
申请人 VMware, Inc. 发明人 Gondi Anjaneya Prasad;Kalluri Hemanth;Kalaskar Naveen Kumar
分类号 G06F11/07;G06F11/20 主分类号 G06F11/07
代理机构 代理人
主权项 1. A system for failure event detection and grouping using adaptive polling intervals and sliding window buffering, said system comprising: a memory area associated with a computing device, said memory area storing a plurality of virtual machines (VMs) and datastores accessible thereto, a value for a short timer, and a value for a long timer; and a processor programmed to: upon detection of a failure event affecting at least one of the plurality of VMs and/or datastores accessible thereto, poll for additional failure events during each of a series of polling intervals until the short timer or the long timer expires, the polling during each of the series of polling intervals comprising: upon detection of at least one of the additional failure events, collecting data relating to the detected additional failure event, resetting the short timer, and reducing a duration of a next polling interval; andupon no detection of at least one of the additional failure events, increasing a duration of a next polling interval;group the detected failure event with the detected additional failure events; andperform recovery operations in parallel for each of the grouped failure events.
地址 Palo Alto CA US