发明名称 System and method for recovering system status consistently to designed recovering time point in distributed database
摘要 A method for recovering system status consistently to a designed recovering time point in a distributed database, wherein the distributed database comprises a primary server and multiple region servers, comprising: when the region servers detect a change of system status, analyzing an event for the change to generate an event log and storing the event log to the database; after a preset condition, the region servers generate a snapshot respectively; when the primary server receives the instruction to recover the system status to a designed time point, indicates the region servers to implement: reading the event log and snapshots stored in region servers; finding the snapshot closest to the designed time point; finding the event log and snapshot corresponding to an time interval between the time recorded in the snapshot and the designed time point to recover the system status to the designed time point.
申请公布号 US9372765(B2) 申请公布日期 2016.06.21
申请号 US201414271501 申请日期 2014.05.07
申请人 NATIONAL TSING HUA UNIVERSITY 发明人 Hsiao Hung-Chang;Liao Chi-Tsun;Tsai Chia-Ping;Chung Yeh-Ching
分类号 G06F17/00;G06F11/14 主分类号 G06F17/00
代理机构 Birch, Stewart, Kolasch & Birch, LLP 代理人 Birch, Stewart, Kolasch & Birch, LLP
主权项 1. A method for recovering a system status of a primary server and multiple region servers consistently to a designed recovering time point in a distributed database having the primary server and the multiple region servers, comprising: when one of the multiple region servers detects every change of the system status, the region server analyzing an event of the change of the system status to generate an event log, and storing the event log to the database, wherein the event logs respectively have a time vector to determine an occurring sequence of the events; after reaching a preset condition the region servers respectively generating a snapshot, wherein the snapshot is a formatted system data file; and when the primary server receives an instruction for recovering the system status to a designed recovering time point, the primary server ordering the region servers to execute the following steps: step 1: obtaining the event logs and the snapshots stored in the region servers;step 2: finding out the snapshots which are closest to the designed recovering time point; andstep 3: finding out corresponding events according to a time interval between a time of recording the snapshot and the designed recovering time point, and recovering the system data file recorded by the snapshot to recover the system status back to the system status of the designed recovering time point according to the corresponding events, wherein the time vector comprises a local code and a global code, and wherein, the step 3 comprises the following steps: determining the time interval between the time of recording the snapshot and the designed recovering time point;determining occurring time of the event in the distributed database according to the global code of the event log, and determining the occurring sequence of the event in a time corresponding to the global code according to the local code of the event log; andrecovering the system status to the designed recovering time point according to the snapshot, the sequence of finding out and the events.
地址 Hsinchu TW