发明名称 Data depulication using short term history
摘要 Exemplary embodiments for data deduplication using short term history in a computing environment are provided. In one embodiment, by way of example only, a hash value is calculated on data chunks for a read operation. The calculated hash value is stored in a storage media. The calculated hash value is looked up in the storage media to verify if a current write operation was previously written and/or read. Additional system and computer program product embodiments are disclosed and provide related advantages.
申请公布号 US8762352(B2) 申请公布日期 2014.06.24
申请号 US201313830313 申请日期 2013.03.14
申请人 International Business Machines Corporation 发明人 Amit Jonathan;Koifman Chaim
分类号 G06F7/00;G06F17/00 主分类号 G06F7/00
代理机构 Griffiths & Seaton PLLC 代理人 Griffiths & Seaton PLLC
主权项 1. A method for data deduplication using short term history by a processor device in a computing environment, the method, using the processor device, comprising: calculating a hash value on data chunks for a read operation of the data chunks from a storage system, wherein the calculated hash value for the read operation of the data chunks from a storage system is stored in storage media; calculating an additional hash value for a current write operation to the storage system as part of the data deduplication; verifying if the data chunks for the current write operation were one of a previously written operation and a previously read operation by looking up the calculated hash value in the storage media and comparing the calculated hash value for the read operation of the data chunks from the storage system to the additional hash value for the current write operation as part of the data deduplication; and saving, in the storage system, only those portions of the data chunks that were updated or changed in the data chunks during the current write operation if the calculated hash value for the read operation of the data chunks from the storage system matches the additional hash value for the write operation, wherein the saved portions of the data chunks that were updated or changed do not constitute a new file.
地址 Armonk NY US