发明名称 Performing A Local Reduction Operation On A Parallel Computer
摘要 A parallel computer including compute nodes, each including two reduction processing cores, a network write processing core, and a network read processing core, each processing core assigned an input buffer. Copying, in interleaved chunks by the reduction processing cores, contents of the reduction processing cores' input buffers to an interleaved buffer in shared memory; copying, by one of the reduction processing cores, contents of the network write processing core's input buffer to shared memory; copying, by another of the reduction processing cores, contents of the network read processing core's input buffer to shared memory; and locally reducing in parallel by the reduction processing cores: the contents of the reduction processing core's input buffer; every other interleaved chunk of the interleaved buffer; the copied contents of the network write processing core's input buffer; and the copied contents of the network read processing core's input buffer.
申请公布号 US2012317399(A1) 申请公布日期 2012.12.13
申请号 US201213585993 申请日期 2012.08.15
申请人 BLOCKSOME MICHAEL A.;FARAJ DANIEL A.;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BLOCKSOME MICHAEL A.;FARAJ DANIEL A.
分类号 G06F15/76;G06F9/02;G06F12/00;G06F15/16 主分类号 G06F15/76
代理机构 代理人
主权项
地址