发明名称 Distributed computing backup and recovery system
摘要 The distributed computing backup and recovery (DCBR) system and method provide backup and recovery for distributed computing models (e.g., NoSQL). The DCBR system extends the protections from server node-level failure and introduces persistence in time so that the evolving data set may be stored and recovered to a past point in time. The DCBR system, instead of performing backup and recovery for an entire dataset, may be configured to apply to a subset of data. Instead of keeping or recovering snapshots of the entire dataset which requires the entire cluster, the DCBR system identifies the particular nodes and/or archive files where the dataset resides so that backup or recovery may be done with a much smaller number of nodes.
申请公布号 US8930320(B2) 申请公布日期 2015.01.06
申请号 US201113250901 申请日期 2011.09.30
申请人 Accenture Global Services Limited 发明人 Tung Teresa;Farooqui Sameer;Richter Owen E.
分类号 G06F17/30;G06F11/14 主分类号 G06F17/30
代理机构 Brinks Gilson & Lione 代理人 Brinks Gilson & Lione
主权项 1. A method for distributed computing backup and recovery, comprising: retrieving at least one user selectable preference; identifying a first subset of data from within a data set according to the at least one user selectable preference, the first subset of data containing less than all of the data in the data set wherein the first subset is selectable by using the user selectable preference; receiving, into a memory via an interface controlled by a processor connected to a network in a computing environment wherein the identified data objects are within a second subset of data, the second subset of data containing less than all of the data in the first subset of data; evaluating, using the processor, a hash function stored in the memory to determine network storage locations or network retrieval locations, or both for the data objects; storing at a granular level, at each of the network storage locations, the data objects according to the data object request, when the data object request comprises a request to store the data objects, where the stored data objects are identified as a replica of the data objects stored at each of the network storage locations; retrieving a hash seed for an identified time used to recreate the hash function for the identified time, where the data object request comprises the identified time to retrieve the data objects;where the hash seed is a random function based on an initial seed, where the hash seed was previously stored for the identified time; recreating, using the hash seed, the hash function for the identified time; retrieving a backup record identifier from a backup log file corresponding to the data object request; applying the hash function to the backup record identifier, where the hash function identifies the network retrieval locations in the computing environment the data objects are stored; identifying the data objects within the second subset of data; retrieving at a granular level from one of the network retrieval locations from a backup of the computing environment, using the processor connected to the network, the stored data objects identified by the one of the network retrieval locations, when the data object request comprises a request to retrieve the data objects, where the stored data objects are retrieved from the second subset of data; retrieving one of the replica of the data objects from a node where the hash function determines where the stored data objects are located of the network retrieval locations; and returning the data objects responsive to the request.
地址 Dublin IE