发明名称 Performing An Allreduce Operation On A Plurality Of Compute Nodes Of A Parallel Computer
摘要 Methods, apparatus, and products are disclosed for performing an allreduce operation on a plurality of compute nodes of a parallel computer, each node including at least two processing cores, that include: performing, for each node, a local reduction operation using allreduce contribution data for the cores of that node, yielding, for each node, a local reduction result for one or more representative cores for that node; establishing one or more logical rings among the nodes, each logical ring including only one of the representative cores from each node; performing, for each logical ring, a global allreduce operation using the local reduction result for the representative cores included in that logical ring, yielding a global allreduce result for each representative core included in that logical ring; and performing, for each node, a local broadcast operation using the global allreduce results for each representative core on that node.
申请公布号 US2009292905(A1) 申请公布日期 2009.11.26
申请号 US20080124763 申请日期 2008.05.21
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 FARAJ AHMAD
分类号 G06F9/30 主分类号 G06F9/30
代理机构 代理人
主权项
地址