发明名称 EXTENSIBLE PIPELINE FOR DATA DEDUPLICATION
摘要 The subject disclosure is directed towards data deduplication (optimization) performed by phases / modules of a modular data deduplication pipeline. At each phase, the pipeline allows modules to be replaced, selected or extended, e.g., different algorithms can be used for chunking or compression based upon the type of data being processed. The pipeline facilitates secure data processing, batch processing, and parallel processing. The pipeline is tunable based upon feedback, e.g., by selecting modules to increase deduplication quality, performance and/or throughput. Also described is selecting, filtering, ranking, sorting and/or grouping the files to deduplicate, e.g., based upon properties and/or statistical properties of the files and/or a file dataset and/or internal or external feedback.
申请公布号 WO2012083262(A2) 申请公布日期 2012.06.21
申请号 WO2011US65657 申请日期 2011.12.16
申请人 MICROSOFT CORPORATION 发明人 OLTEAN, PAUL ADRIAN;KALACH, RAN;EL-SHIMI, AHMED M.;BENTON, JAMES ROBERT
分类号 G06F9/38;G06F11/14;G06F12/00 主分类号 G06F9/38
代理机构 代理人
主权项
地址