发明名称 DATA DE-DUPLICATION FOR DISK IMAGE FILES
摘要 The invention relates to a data processing system, comprising at least two disk emulators operating in parallel and emulating a disk subsystem each, the disk emulators each using a file in a file system for any data stored on the respective disk, a separate de-duplicator for de-duplicating the data stored in the files, the de-duplicator operating in parallel to the disk emulators, the de-duplicator further using an additional disk emulator emulating an additional disk subsystem by using an additional file in a file system for storing data shared between the other disk subsystems.
申请公布号 US2014214776(A1) 申请公布日期 2014.07.31
申请号 US201414165816 申请日期 2014.01.28
申请人 International Business Machines Corporation 发明人 Bolte Dirk;Elfering Herwig;Heisser Susanne Friedhilde;Nowak Juergen;Pohl Thomas;Troester Martin
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A data processing system, comprising: at least two disk emulators operating in parallel and each emulating a disk subsystem, the disk emulators using a file in a file system for any data stored on the respective disk; a separate de-duplicator for de-duplicating the data stored in the files, the de-duplicator operating in parallel to the disk emulators, the de-duplicator further using an additional disk emulator emulating an additional disk subsystem by using an additional file in a file system for storing data shared between the other disk subsystems; andwherein, the additional file is accessible by all disk emulators;the de-duplicator comprising: duplication identification means for communicating with the disk emulators to identify duplicated data in the respective files; storing means responsive to the duplication identification means to retrieve duplicated data from the files and store it in the additional file via the additional disk emulator; linking means responsive to the storing means to instruct a disk emulator to delete the retrieved duplicated data in its file and replace it with a reference to the duplicated data in the additional file; and upon receiving an instruction to delete and replace data from the de-duplicator a disk emulator performing an atomic operation to delete the respective data and replacing it with a reference to the respective data in the additional file.
地址 Armonk NY US