发明名称 Preventing unnecessary data recovery
摘要 A method that prevents unnecessary data recovery includes receiving, at a data processing device, a status of a resource of a distributed system. When the status of the resource indicates a resource failure, the method includes executing instructions on the data processing device to determine whether the resource failure is correlated to any other resource failures within the distributed system. When the resource failure is correlated to other resource failures within the distributed system, the method includes delaying execution on the data processing device of a remedial action associated with the resource. However, when the resource failure is uncorrelated to other resource failures within the distributed system, the method includes initiating execution on the data processing device of the remedial action associated with the resource.
申请公布号 US9223644(B1) 申请公布日期 2015.12.29
申请号 US201414188965 申请日期 2014.02.25
申请人 Google Inc. 发明人 Schrock Christian Eric;Cypher Robert;Schirripa Steven Robert
分类号 G06F11/00;G06F11/07 主分类号 G06F11/00
代理机构 Honigman Miller Schwartz and Cohn LLP 代理人 Honigman Miller Schwartz and Cohn LLP
主权项 1. A method comprising: receiving, at a data processing device, a status of a resource of a distributed system; when the status of the resource indicates a resource failure, executing instructions on the data processing device to determine whether the resource failure is correlated to any other resource failures within the distributed system; when the resource failure is correlated to other resource failures within the distributed system, delaying execution on the data processing device of a remedial action associated with the resource; when the resource failure is uncorrelated to other resource failures within the distributed system, initiating execution on the data processing device of the remedial action associated with the resource; when the resource failure is correlated to other resource failures within the distributed system, executing the remedial action on the data processing device after a first threshold period of time; and when the resource failure is uncorrelated to other resource failures within the distributed system, executing the remedial action on the data processing device after a second threshold period of time; wherein the first threshold period of time is greater than the second threshold period of time.
地址 Mountain View CA US