摘要 |
A multiple node storage system cluster that allows for a takeover by a takeover node simultaneously with a failing node resetting its storage adapters is provided. A takeover monitor on the failing node initiates a “coredump” procedure by selecting a coredump disk. After selecting the coredump disk, the failing node determines the world wide name (WWN) of that disk and sends this information in a message across the cluster interconnect to the takeover node. In response to receipt of this message, the takeover node begins takeover procedures with respect to all disks except for the coredump disk. The failing node simultaneously resets its storage adapters and writes is memory to the coredump disk. The failing node later updates a completion header on that disk. The takeover node completes the takeover without waiting for the storage adapter reset, and subsequently reads the completion header and copies coredump information into its memory.
|