发明名称 Data depulication using short term history
摘要 Exemplary system, and computer program product embodiments for data deduplication using short term history in a computing environment are provided. In one embodiment, by way of example only, a hash value is calculated on data chunks for a read operation. The calculated hash value is stored in a storage media. The calculated hash value is looked up in the storage media to verify if a current write operation was previously written and/or read. Additional system and computer program product embodiments are disclosed and provide related advantages.
申请公布号 US8788468(B2) 申请公布日期 2014.07.22
申请号 US201213480194 申请日期 2012.05.24
申请人 International Business Machines Corporation 发明人 Amit Jonathan;Koifman Chaim
分类号 G06F7/00;G06F17/00 主分类号 G06F7/00
代理机构 Griffiths & Seaton PLLC 代理人 Griffiths & Seaton PLLC
主权项 1. A system for data deduplication using short term history in a computing environment, comprising: a processor device, operable in the computing storage environment and controls the system for the data deduplication, wherein the processor device is adapted for: calculating a hash value on data chunks for a read operation of the data chunks from a storage system, wherein the calculated hash value for the read operation of the data chunks from a storage system is stored in storage media;calculating an additional hash value for a current write operation to the storage system as part of the data deduplication;verifying if data chunks for the current write operation were one of a previously written operation and a previously read operation by looking up the calculated hash value in the storage media and comparing the calculated hash value for the read operation of the data chunks from the storage system to the additional hash value of the data chunks for the current write operation as part of the data deduplication; andsaving, in the storage system, only those portions of the data chunks that were updated or changed during the current write operation if the calculated hash value for the read operation of the data chunks from the storage system matches the additional hash value of the data chunks for the current write operation as part of the data deduplication, wherein the saved portions of the data chunks that were updated or changed do not constitute a new file.
地址 Armonk NY US