发明名称 PROACTIVE FAILURE RECOVERY MODEL FOR DISTRIBUTED COMPUTING
摘要 This disclosure generally describes methods and systems, including computer-implemented methods, computer-program products, and computer systems, for providing a proactive failure recovery model for distributed computing. One computer-implemented method includes building a virtual tree-like computing structure of a plurality of computing nodes, for each computing node of the virtual tree-like computing structure, performing, by a hardware processor, a node failure prediction model to calculate a mean time between failure (MTBF) associated with the computing node, determining whether to perform a checkpoint of the computing node based on a comparison between the calculated MTBF and a maximum and minimum threshold, migrating a process from the computing node to a different computing node acting as a recovery node, and resuming execution of the process on the different computing node.
申请公布号 CA2956567(A1) 申请公布日期 2016.02.04
申请号 CA20152956567 申请日期 2015.07.20
申请人 SAUDI ARABIAN OIL COMPANY 发明人 AL-WAHABI, KHALID S.
分类号 G06F11/07;G06F11/14 主分类号 G06F11/07
代理机构 代理人
主权项
地址