发明名称 Systems and methods for fault tolerant communications
摘要 Apparatuses, systems and methods are disclosed for tolerating fault in a communications grid. Specifically, various techniques and systems are provided for detecting a fault or failure by a node in a network of computer nodes in a communications grid, adjusting the grid to avoid grid failure, and taking action based on the failure. In an example, a system may include receiving grid status information at a backup control node, the grid status information including a project status, storing the grid status information within the backup control node, receiving a failure communication including an indication that a primary control node has failed, designating the backup control node as a new primary control node, receiving updated grid status information based on the indication that the primary control node has failed, and transmitting a set of instructions based on the updated grid status information.
申请公布号 US9424149(B2) 申请公布日期 2016.08.23
申请号 US201514747763 申请日期 2015.06.23
申请人 SAS INSTITUTE INC. 发明人 Knight Richard
分类号 G06F11/00;G06F11/20 主分类号 G06F11/00
代理机构 Kilpatrick Townsend & Stockton, LLP 代理人 Kilpatrick Townsend & Stockton, LLP
主权项 1. A computer-program product tangibly embodied in a non-transitory machine-readable storage medium, including instructions configured to cause a data processing apparatus to: receive, at a backup control node connected to a primary control node and a worker node on a communications grid, grid status information, the grid status information including a project status of the primary control node or a project status of the worker node, wherein the project status of the primary control node and the project status of the worker node include a status of one or more portions of a project being executed by the primary and worker nodes in the communications grid; store the grid status information within the backup control node; receive a failure communication including an indication that the primary control node has failed; designate the backup control node as a new primary control node based on the failure communication upon receiving the failure communication; receive, at the backup control node, updated grid status information based on the indication that the primary control node has failed, wherein the updated grid status information includes an updated project status of the primary control node or an updated project status of the worker node; and transmit, by the backup control node, a set of instructions based on the updated grid status information, wherein the set of instructions includes instructions for the worker nodes to continue work on the project after failure of the primary control node.
地址 Cary NC US