发明名称 Method and apparatus for offloading compute resources to a flash co-processing appliance
摘要 Solid-State Drive (SSD) burst buffer nodes are interposed into a parallel supercomputing cluster to enable fast burst checkpoint of cluster memory to or from nearby interconnected solid-state storage with asynchronous migration between the burst buffer nodes and slower more distant disk storage. The SSD nodes also perform tasks offloaded from the compute nodes or associated with the checkpoint data. For example, the data for the next job is preloaded in the SSD node and very fast uploaded to the respective compute node just before the next job starts. During a job, the SSD nodes perform fast visualization and statistical analysis upon the checkpoint data. The SSD nodes can also perform data reduction and encryption of the checkpoint data.
申请公布号 US9158540(B1) 申请公布日期 2015.10.13
申请号 US201213676019 申请日期 2012.11.13
申请人 EMC Corporation;Los Alamos National Security, LLC 发明人 Tzelnic Percy;Faibish Sorin;Gupta Uday K.;Bent John;Grider Gary Alan;Chen Hsing-bung
分类号 G06F7/38;G06F11/00;G06F9/30 主分类号 G06F7/38
代理机构 Novak Druce Connolly Bove + Quigg LLP 代理人 Auchterlonie Richard;Novak Druce Connolly Bove + Quigg LLP
主权项 1. A parallel supercomputing cluster comprising: compute nodes interconnected in a mesh of data links for executing a Message Passing Interface (MPI) job using MPI data transfer between the computer nodes over the mesh of data links; and solid-state storage nodes each linked to a respective group of the compute nodes for receiving checkpoint data from the respective compute nodes, and magnetic disk storage linked to each of the solid-state storage nodes for asynchronous migration of the checkpoint data from the solid-state storage nodes to the magnetic disk storage; wherein each solid-state storage node includes a data processor coupled to the respective group of compute nodes for receiving the checkpoint data from the respective group of compute nodes and coupled to the magnetic disk storage for transmitting the checkpoint data to the magnetic disk storage, solid state storage coupled to the data processor for buffering the checkpoint data, and non-transitory computer readable storage medium storing computer instructions that, when executed by the data processor, perform the steps of: (a) presenting a file system interface to the MPI job, and multiple MPI processes of the MPI job writing the checkpoint data to a shared file in the solid-state storage in a strided fashion in a first data layout; (b) asynchronously migrating the checkpoint data from the shared file in the solid-state storage to the magnetic disk storage and writing the checkpoint data to the magnetic disk storage in a sequential fashion in a second data layout; and (c) performing additional tasks offloaded from the compute nodes or associated with the checkpoint data.
地址 Hopkinton MA US