发明名称 Replication of deduplicated data
摘要 Various embodiments for replicating deduplicated data using a processor device are provided. A block of the deduplicated data, created in a source repository, is assigned a global block identifier (ID) unique in a grid set inclusive of the source repository. The global block ID is generated using at least one unique identification value of the block, a containing grid of the grid set, and the source repository. The global block ID is transmitted from the source repository to a target repository. If the target repository determines the global block ID is associated with an existing block of the deduplicated data located within the target repository, the block is not transmitted to the target repository during a subsequent replication process.
申请公布号 US8880482(B2) 申请公布日期 2014.11.04
申请号 US201313732461 申请日期 2013.01.02
申请人 International Business Machines Corporation 发明人 Akirav Shay H.;Aronovich Lior;Asher Ron;Bachar Yariv;Ish-Shalom Ariel J.;Leneman Ofer
分类号 G06F7/00;G06F17/00;G06F17/30;G06F11/20;G06F3/06 主分类号 G06F7/00
代理机构 Griffiths & Seaton PLLC 代理人 Griffiths & Seaton PLLC
主权项 1. A method for replicating deduplicated data, comprising: assigning a global block identifier to each of a plurality of blocks of deduplicated data in a source repository, wherein: each global block identifier is not dependent on the data contents in a probabilistic manner,each global block identifier is independent of a physical location of an associated block,each global block identifier comprises a block ID portion and at least one additional ID portion,each block ID portion is set to a value and each additional ID portion is set to a null value in the source repository, andeach block ID portion is discarded after a single use; transmitting a plurality of global block identifiers from the source repository to a target repository, wherein said transmitting comprises adding a value to each additional ID portion such that both the block ID portion and each additional ID portion comprises a value when each global block identifier is transmitted from the source repository to the target repository; determining whether each global block identifier is associated with an existing block of the deduplicated data located within the target repository; partitioning the plurality of global block identifiers into a first portion of global block identifiers previously existing within the target repository and a second portion of global block identifiers previously non-existing within the target repository based on the determination; transmitting, by the target repository, the first portion of global block identifiers back to the source repository; and transmitting, by the source repository, data from each block of deduplicated data associated with the first portion of global block identifiers to the target repository in response to receipt of the first portion of the global block identifiers.
地址 Armonk NY US