摘要 |
A system and method for configuring a cluster of computer nodes to save and restore state in the cluster in the event of node failures. The system and method are implemented through an application programming interface that includes a membership application, a locks application and a dataspace application. The membership application maintains a set of nodes in the cluster. The lock application provides a means for service applications running on the nodes to synchronize access to dataspaces. The dataspaces provide a cluster-wide shared regions in the memory of the cluster members. The API is configured to monitor the cluster members and to coordinate reallocation of a service application if a node running the service application fails.
|