发明名称 Efficient data deployment for a parallel data processing system
摘要 This document describes techniques for efficient data deployment for a parallel data processing system. In one embodiment, a virtualization platform running a parallel processing application that includes one or more virtual data nodes receives a first command to write a data block to a storage device. The platform then determines whether the first command was sent by a first virtual data node. If the first command was sent by a first virtual data node, the platform then 1) writes, the data block to a first location in the storage device; 2) returns the first location to the first virtual data node and 3) determines whether the data should be replicated. If the data should be replicated, the platform instructs the storage device to make a copy of the data block to a second location in the storage device and storing the second location in a tracking structure.
申请公布号 US9582209(B2) 申请公布日期 2017.02.28
申请号 US201514748262 申请日期 2015.06.24
申请人 VMware, Inc. 发明人 Shih Chiao-Chuan;Nayak Samdeep
分类号 G06F3/06 主分类号 G06F3/06
代理机构 代理人
主权项 1. A method for deploying a data block comprising: at a virtualization platform running a parallel processing application that includes one or more virtual data nodes: receiving a first command to write a data block to a storage device;determining whether the first command was sent by a first virtual data node; andif the first command was sent by a first virtual data node: writing the data block to a first location in the storage device,returning the first location to the first virtual data node,determining whether the data should be replicated, andif the data should be replicated, instructing the storage device to internally make a copy of the data block to a second location in the storage device and storing the second location in a tracking structure.
地址 Palo Alto CA US