发明名称 Deduplication of virtual machine content
摘要 Methods and systems for managing, storing, and serving data within a virtualized environment are described. In some embodiments, a data management system may manage the extraction and storage of virtual machine snapshots, provide near instantaneous restoration of a virtual machine or one or more files located on the virtual machine, and enable secondary workloads to directly use the data management system as a primary storage target to read or modify past versions of data. The data management system may allow a virtual machine snapshot of a virtual machine stored within the system to be directly mounted to enable substantially instantaneous virtual machine recovery of the virtual machine.
申请公布号 US9569124(B2) 申请公布日期 2017.02.14
申请号 US201514628022 申请日期 2015.02.20
申请人 RUBRIK, INC. 发明人 Jain Arvind;Mazumdar Soham;Botelho Fabiano
分类号 G06F21/00;H04L9/32;G06F3/06;G06F9/455;G06F9/50;G06F11/14;G06F17/30;H04L12/26;H04L29/12;H04L29/08;G06F11/20 主分类号 G06F21/00
代理机构 Vierra Magen Marcus LLP 代理人 Vierra Magen Marcus LLP
主权项 1. A method for operating a data management system, comprising: acquiring a first snapshot of a first virtual machine, the first snapshot comprises a full image of the first virtual machine; generating a signature for the first virtual machine using the full image, the generating a signature includes generating a plurality of hash values corresponding with a plurality of data blocks within the full image, the plurality of data blocks is arranged such that data blocks of a first plurality of the plurality of data blocks are spaced at a fixed distance from each other and data blocks of a second plurality of the plurality of data blocks are spaced at monotonically increasing distances from each other; identifying a second virtual machine based on the signature, the second virtual machine is associated with a base image; generating a dependent base file associated with the first snapshot using the full image and the base image, the dependent base file comprises data differences between the first snapshot of the first virtual machine and a second snapshot of the second virtual machine corresponding with the base image, the first plurality corresponds with a first data region within the full image of the first virtual machine, the second plurality corresponds with a second data region within the full image of the first virtual machine that does not overlap with the first data region; and storing the dependent base file.
地址 Palo Alto CA US