发明名称 Performing an allreduce operation on a plurality of compute nodes of a parallel computer
摘要 Methods, apparatus, and products are disclosed for performing an allreduce operation on a plurality of compute nodes of a parallel computer. Each compute node includes at least two processing cores. Each processing core has contribution data for the allreduce operation. Performing an allreduce operation on a plurality of compute nodes of a parallel computer includes: establishing one or more logical rings among the compute nodes, each logical ring including at least one processing core from each compute node; performing, for each logical ring, a global allreduce operation using the contribution data for the processing cores included in that logical ring, yielding a global allreduce result for each processing core included in that logical ring; and performing, for each compute node, a local allreduce operation using the global allreduce results for each processing core on that compute node.
申请公布号 US8161268(B2) 申请公布日期 2012.04.17
申请号 US20080124756 申请日期 2008.05.21
申请人 FARAJ AHMAD;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 FARAJ AHMAD
分类号 G06F15/80;G06F9/30;G06F9/302 主分类号 G06F15/80
代理机构 代理人
主权项
地址