发明名称 |
DATA DE-DUPLICATION FOR DISK IMAGE FILES |
摘要 |
The invention relates to a data processing system, comprising at least two disk emulators operating in parallel and emulating a disk subsystem each, the disk emulators each using a file in a file system for any data stored on the respective disk, a separate de-duplicator for de-duplicating the data stored in the files, the de-duplicator operating in parallel to the disk emulators, the de-duplicator further using an additional disk emulator emulating an additional disk subsystem by using an additional file in a file system for storing data shared between the other disk subsystems. |
申请公布号 |
US2014214776(A1) |
申请公布日期 |
2014.07.31 |
申请号 |
US201414165816 |
申请日期 |
2014.01.28 |
申请人 |
International Business Machines Corporation |
发明人 |
Bolte Dirk;Elfering Herwig;Heisser Susanne Friedhilde;Nowak Juergen;Pohl Thomas;Troester Martin |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A data processing system, comprising:
at least two disk emulators operating in parallel and each emulating a disk subsystem, the disk emulators using a file in a file system for any data stored on the respective disk; a separate de-duplicator for de-duplicating the data stored in the files, the de-duplicator operating in parallel to the disk emulators, the de-duplicator further using an additional disk emulator emulating an additional disk subsystem by using an additional file in a file system for storing data shared between the other disk subsystems; andwherein,
the additional file is accessible by all disk emulators;the de-duplicator comprising:
duplication identification means for communicating with the disk emulators to identify duplicated data in the respective files; storing means responsive to the duplication identification means to retrieve duplicated data from the files and store it in the additional file via the additional disk emulator; linking means responsive to the storing means to instruct a disk emulator to delete the retrieved duplicated data in its file and replace it with a reference to the duplicated data in the additional file; and upon receiving an instruction to delete and replace data from the de-duplicator a disk emulator performing an atomic operation to delete the respective data and replacing it with a reference to the respective data in the additional file. |
地址 |
Armonk NY US |