发明名称 DYNAMICALLY REROUTING NODE TRAFFIC ON A MASSIVELY PARALLEL COMPUTER SYSTEM USING HINT BITS
摘要 A method and apparatus for dynamically rerouting node processes on the compute nodes of a massively parallel computer system using hint bits to route around failed nodes or congested networks without restarting applications executing on the system. When a node has a failure or there are indications that it may fail, the application software on the system is suspended while the data on the failed node is moved to a backup node. The torus network traffic is routed around the failed node and traffic for the failed node is rerouted to the backup node. The application can then resume operation without restarting from the beginning.
申请公布号 US2008263386(A1) 申请公布日期 2008.10.23
申请号 US20070736811 申请日期 2007.04.18
申请人 DARRINGTON DAVID L;MCCARTHY PATRICK JOSEPH;PETERS AMANDA;SIDELNIK ALBERT;SMITH BRIAN EDWARD;SWARTZ BRENT ALLEN 发明人 DARRINGTON DAVID L.;MCCARTHY PATRICK JOSEPH;PETERS AMANDA;SIDELNIK ALBERT;SMITH BRIAN EDWARD;SWARTZ BRENT ALLEN
分类号 G06F11/00 主分类号 G06F11/00
代理机构 代理人
主权项
地址