发明名称 Multi stream deduplicated backup of collaboration server data
摘要 Techniques to backup collaboration server data are disclosed. An indication to begin backup of a collaboration server dataset is received. An associated directory is walked in a prescribed order to divide the dataset into a prescribe number of approximately equal-sized subsets. A separate subset-specific thread is used to back up the subsets in parallel. In some embodiments in which the collaboration data is stored in multiple volumes, a volume-based approach is used to back up the volumes in parallel, e.g., one volume per thread. In some embodiments, transaction logs are backed up in parallel with volumes of collaboration data.
申请公布号 US9165001(B1) 申请公布日期 2015.10.20
申请号 US201213720814 申请日期 2012.12.19
申请人 EMC Corporation 发明人 Upadhyay Navneet;Tadahal Manjunath
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Van Pelt, Yi & James LLP 代理人 Van Pelt, Yi & James LLP
主权项 1. A method of backing up data, comprising: receiving an indication to begin backup of a collaboration server dataset; walking an associated directory in a prescribed order to divide the dataset into a prescribe number of approximately equal-sized subsets, wherein the directory comprises a plurality of files; and using a separate subset-specific thread to back up the subsets in parallel;wherein each subset-specific backup thread is configured to provide data included in that subset to a backup-thread specific de-duplicating backup process instance configured to perform de-duplication processing with respect to the subset and a corresponding subset associated with a prior backup; and wherein the corresponding subset was determined by walking the associated directory in the prescribed order at a prior time with which the prior backup is associated.
地址 Hopkinton MA US