发明名称 Data deduplication in a removable storage device
摘要 An apparatus and associated methodology contemplate a data storage system having a removable storage device operably transferring user data between the data storage system and another device via execution of a plurality of input/output commands. A commonality factoring module executes computer instructions stored in memory to assign commonality information to the user data. A deduplication module executes computer instructions stored in memory to combine a plurality of files of the user data (user data files) with at least one file of corresponding commonality information (commonality information file), the combined files forming a sequential data stream.
申请公布号 US9396204(B2) 申请公布日期 2016.07.19
申请号 US201213725088 申请日期 2012.12.21
申请人 Spectra Logic 发明人 Starr Matthew Thomas
分类号 G06F12/00;G06F17/30 主分类号 G06F12/00
代理机构 代理人 Altshuler Kenneth
主权项 1. A method comprising: providing a data storage system possessing a tape drive, a tape cartridge containing a tape medium, and a commonality factoring module; transferring user data from a client to the data storage system via execution of a plurality of input/output transfers; determining a first portion and a second portion of the user data; performing first commonality factoring to the first portion of the user data to create a first commonality factor and second commonality factoring to the second portion of the user data to create a second commonality factor; after the performing step, assigning first commonality information to the first portion of the user data and second commonality information to the second portion of the user data; comparing the first commonality factor and the second commonality factor to a database of pre-existing commonality factors; establishing the second commonality factor exists in the database of pre-existing commonality factors; streaming the first portion of the user data and the second commonality information into a sequential data stream that is devoid of the second portion of the user data; and storing the sequential data stream to the tape medium via the tape drive.
地址 Boulder CO US