发明名称 DATA DEDUPLICATION STORAGE SYSTEM AND PROCESS
摘要 A data deduplication storage system and process is disclosed. In one implementation deduplication file storage system is added to an existing file storage system by receiving first files via a network from a remotely disposed computing device, dividing the files into data objects, creating hash values for the data objects, and store the data objects on more remotely disposed storage systems at network location addresses. Records of a storage table disposed on the intermediate device or a secondary remote storage system are stored for the data objects containing the hash values and corresponding network location addresses. When another of the files including one or more second data objects are received, a determination is made if the second data objects were previously stored on remotely disposed storage systems by comparing hash values for the second data object against hash values stored in records of the storage table.
申请公布号 US2017124107(A1) 申请公布日期 2017.05.04
申请号 US201615298897 申请日期 2016.10.20
申请人 StoreReduce 发明人 Emberson Mark Alexander Hugh
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method to deduplicate file storage in a file storage system via a network, the method comprising: executing with a processor a first set of instructions stored in a memory device, the first set of instructions when executed by the processor: receive one or more first files via the network from the remotely disposed computing device; divide the one or more first files into one or more data objects;create one or more hash values for the one or more data objects;store the one or more data objects on one or more remotely disposed storage systems at one or more location addresses;store in one or more records of a storage table disposed on the intermediate device or a secondary remote storage system for each of the one or more data objects the one or more hash values and the one or more location addresses where the one or more data objects are stored;receive from the networked computing device via the network one or more second files; andin response to the receipt via the network from the networked computing device of the one or more second files including one or more second data objects, determine whether or not the one or more second data objects were previously stored on one or more remotely disposed storage systems by comparing one or more hash values for the one or more second data objects against the one or more hash values stored in one or more records of the storage table, andstoring via the network one or more second data objects, not previously stored on the one or more remotely disposed storage system, on the one or more remotely disposed storage systems in response to a determination that the comparing of one or more hash values for the second data object against the one or more hash values for the first data object and stored in one or more records of the storage table failed to indicate a match.
地址 Sunnyvale CA US