发明名称 |
Communication link fault tolerance in a supercomputer |
摘要 |
A method of operating a supercomputer having a plurality of computing elements each connected to a fast communications link is disclosed, the method comprising the steps of: scheduling specified elements to perform computing tasks in specified cycles of a computing operation; in the event of failure of a fast communications link in a given cycle, transferring state from a disabled element no longer able to communicate as a result of the failure to an idle element not scheduled to perform a task in the given cycle; operating the idle element to perform any uncompleted tasks scheduled for the disabled element remaining in the cycle. |
申请公布号 |
GB2419696(B) |
申请公布日期 |
2008.07.16 |
申请号 |
GB20040024125 |
申请日期 |
2004.10.29 |
申请人 |
HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. |
发明人 |
CHRISTOPHER TOFTS;RICHARD TAYLOR;JOHN WILLIAM LUMLEY |
分类号 |
G06F11/20 |
主分类号 |
G06F11/20 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|