发明名称 Communication link fault tolerance in a supercomputer
摘要 A method of operating a supercomputer having a plurality of computing elements each connected to a fast communications link is disclosed, the method comprising the steps of: scheduling specified elements to perform computing tasks in specified cycles of a computing operation; in the event of failure of a fast communications link in a given cycle, transferring state from a disabled element no longer able to communicate as a result of the failure to an idle element not scheduled to perform a task in the given cycle; operating the idle element to perform any uncompleted tasks scheduled for the disabled element remaining in the cycle.
申请公布号 GB2419696(B) 申请公布日期 2008.07.16
申请号 GB20040024125 申请日期 2004.10.29
申请人 HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. 发明人 CHRISTOPHER TOFTS;RICHARD TAYLOR;JOHN WILLIAM LUMLEY
分类号 G06F11/20 主分类号 G06F11/20
代理机构 代理人
主权项
地址