发明名称 PERFORMING DE-DUPLICATION FOR AT LEAST ONE COMPUTER FILE IN A COMPUTER SYSTEM
摘要 The present invention provides a method and system of performing de-duplication for at least one computer file in a computer system. In an exemplary embodiment, the method and system include (1) tuning a rolling-hash algorithm for the de-duplication, (2) chunking the data in the file into chunks of data by using the tuned algorithm, (3) producing a content identifier for each of the chunks, and (4) processing the chunks that are unique, the content identifier for each of the chunks that are unique, and references to the chunks that are unique. In an exemplary embodiment, the computer system includes a de-duplication-enabled data store. In an exemplary embodiment, the computer system includes (a) a transferor computer system that is configured to transfer the file to a de-duplication-enabled computer system and (b) the de-duplication-enabled computer system.
申请公布号 US2009276454(A1) 申请公布日期 2009.11.05
申请号 US20080113136 申请日期 2008.04.30
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 SMITH MARK ANDREW
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址