发明名称 |
Data deduplication in a removable storage device |
摘要 |
An apparatus and associated methodology contemplate a data storage system having a removable storage device operably transferring user data between the data storage system and another device via execution of a plurality of input/output commands. A commonality factoring module executes computer instructions stored in memory to assign commonality information to the user data. A deduplication module executes computer instructions stored in memory to combine a plurality of files of the user data (user data files) with at least one file of corresponding commonality information (commonality information file), the combined files forming a sequential data stream. |
申请公布号 |
US9396204(B2) |
申请公布日期 |
2016.07.19 |
申请号 |
US201213725088 |
申请日期 |
2012.12.21 |
申请人 |
Spectra Logic |
发明人 |
Starr Matthew Thomas |
分类号 |
G06F12/00;G06F17/30 |
主分类号 |
G06F12/00 |
代理机构 |
|
代理人 |
Altshuler Kenneth |
主权项 |
1. A method comprising:
providing a data storage system possessing a tape drive, a tape cartridge containing a tape medium, and a commonality factoring module; transferring user data from a client to the data storage system via execution of a plurality of input/output transfers; determining a first portion and a second portion of the user data; performing first commonality factoring to the first portion of the user data to create a first commonality factor and second commonality factoring to the second portion of the user data to create a second commonality factor; after the performing step, assigning first commonality information to the first portion of the user data and second commonality information to the second portion of the user data; comparing the first commonality factor and the second commonality factor to a database of pre-existing commonality factors; establishing the second commonality factor exists in the database of pre-existing commonality factors; streaming the first portion of the user data and the second commonality information into a sequential data stream that is devoid of the second portion of the user data; and storing the sequential data stream to the tape medium via the tape drive. |
地址 |
Boulder CO US |