发明名称 Managing dereferenced chunks in a deduplication system
摘要 A chunk index has information on chunks in a storage space referenced in objects in the storage space. The chunk index includes a reference count for each chunk indicating a number of objects in which the chunk is referenced and a reference measurement representing a level of data object references to the chunk. One chunk is selected to remove from the storage space based on a criteria applied to the reference measurements of chunks having reference counts indicating that the chunks are not referenced in one object in the storage space.
申请公布号 US8775390(B2) 申请公布日期 2014.07.08
申请号 US201213477908 申请日期 2012.05.22
申请人 International Business Machines Corporation 发明人 Anglin Matthew J.;Cannon David M.;Dawson Colin S.;Elder Robert S.
分类号 G06F17/00;G06F17/30 主分类号 G06F17/00
代理机构 Konrad, Raynes, Davda and Victor LLP 代理人 Victor David W.;Konrad, Raynes, Davda and Victor LLP
主权项 1. A implemented-complemented method for maintaining, by a processor, data objects in a storage space, comprising: maintaining a chunk index having information on chunks in the storage space referenced in data objects, wherein the chunk index includes a reference count for each chunk indicating a number of the data objects in which the chunk is referenced and a reference measurement representing a level of the data objects references to the chunk; selecting one chunk to remove from the storage space based on a criteria applied to the reference measurements of chunks having reference counts indicating that the chunks are not referenced in one of the data objects in the storage space, wherein the reference measurement of each chunk comprises a time most recently dereferenced indicating a time when the reference count for the chunk was decremented to indicate that the chunk is not referenced in one of the data objects; and returning indication of the selected chunk to remove from the storage space.
地址 Armonk NY US