发明名称 Preserving redundancy in data deduplication systems by designation of virtual address
摘要 Various embodiments for preserving data redundancy of identical data in a data deduplication system in a computing environment are provided. A selected range of virtual addresses of a virtual storage device in the computing environment is designated as not subject to a deduplication operation. Other system and computer program product embodiments are disclosed and provide related advantages.
申请公布号 US9262428(B2) 申请公布日期 2016.02.16
申请号 US201213453266 申请日期 2012.04.23
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Fiske Rahul M.;Jones Carl Evan;Roy Subhojit
分类号 G06F17/30;G06F3/06 主分类号 G06F17/30
代理机构 Griffiths & Seaton PLLC 代理人 Griffiths & Seaton PLLC
主权项 1. A system for preserving data redundancy of identical data in a data deduplication system in a computing environment, comprising: a host computing device including a host application; a storage device comprising a plurality of physical disks coupled to the host computing device, the storage device further comprising a virtual storage device comprising a plurality of virtual addresses; and a storage controller coupled to the storage device and operable in the computing environment, wherein the storage controller is configured for: implementing a general storage policy including a deduplication operation for preventing multiple copies of the same data from being stored,designating a first range of virtual addresses and a second range of virtual addresses in the virtual storage device as not being subject to the deduplication operation,designating a remainder of the virtual addresses in the virtual storage device as being subject to the deduplication operation, andwherein the host application is configured for: generating a plurality of data blocks,implementing a specific storage policy for each respective data block generated by the host application so that at least a portion of the data blocks generated by the host application are not subject to the general storage policy, each specific storage policy implemented by:determining an importance of each generated data block,determining whether to store each respective data block in one or more of the first range of virtual addresses, the second range of virtual addresses, and the remainder of the virtual addresses based on the importance of each respective data block,storing important data blocks to one or more of the first range of virtual addresses and the second range of virtual addresses so that the data blocks generated by the host application and stored in the first range of virtual addresses and the second range of virtual addresses are not subject to data deduplication, andstoring non-important data block to the remainder of the virtual addresses so that the data blocks generated by and stored in the remainder of the virtual addresses are subject to data deduplication.
地址 Armonk NY US