发明名称 Scalable transport method for multicast replication
摘要 Embodiments disclosed herein provide advantageous methods and systems that use multicast communications via unreliable datagrams sent on a protected traffic class. These methods and systems provide effectively reliable multicast delivery while avoiding the overhead associated with point-to-point protocols. Rather than an exponential scaling of point-to-point connections (with expensive setup and teardown of the connections), the traffic from one server is bounded by linear scaling of multicast groups. In addition, the multicast rendezvous disclosed herein creates an edge-managed flow control that accounts for the dynamic state of the storage servers in the cluster, without needing centralized control, management or maintenance of state. This traffic shaping avoids the loss of data due to congestion during sustained oversubscription. Other embodiments, aspects and features are also disclosed.
申请公布号 US9338019(B2) 申请公布日期 2016.05.10
申请号 US201314095839 申请日期 2013.12.03
申请人 Nexenta Systems, Inc. 发明人 Bestler Caitlin;Novak Robert E.;Aizman Alexander
分类号 H04L12/18;H04L29/08;G06F17/30;H04L29/06;H04L12/801;G06F9/46 主分类号 H04L12/18
代理机构 Okamoto & Benedicto LLP 代理人 Okamoto & Benedicto LLP
主权项 1. A method of distributing a chunk which encodes data or object metadata within a cluster of storage servers, wherein distributing the chunk within the cluster of storage servers comprises performing a chunk put transaction, the method comprising: negotiating a rendezvous group by exchanging unreliable datagrams amongst an initiating client and a negotiating group to determine the rendezvous group, wherein the negotiating group comprises a subset of the storage servers, wherein said negotiating uses a cluster-consensus procedure where each member of the negotiating group evaluates delivery options for the chunk put transaction, wherein the delivery options are evaluated consistently by members of the negotiating group, and wherein said exchanging comprises multicasting the unreliable datagrams from the initiating client to the negotiating group and multicasting put accept responses from each storage server in the negotiating group to all other storage servers in the negotiating group; encoding the chunk in a sequence of unreliable datagrams; and multicasting the chunk by transmitting the sequence of unreliable datagrams in a rendezvous transfer to the rendezvous group, which is a multicast group, such that a single transmission of the sequence of the unreliable datagrams results in reception of the chunk by multiple members of the rendezvous group.
地址 Santa Clara CA US