发明名称 Optimized Corner Turns for Local Storage and Bandwidth Reduction
摘要 A block matrix multiplication mechanism is provided for reversing the visitation order of blocks at corner turns when performing a block matrix multiplication operation in a data processing system. By reversing the visitation order, the mechanism eliminates a block load at the corner turns. In accordance with the illustrative embodiment, a corner return is referred to as a“bounce”corner turn and results in a serpentine patterned processing order of the matrix blocks. The mechanism allows the data processing system to perform a block matrix multiplication operation with a maximum of three block transfers per time step. Therefore, the mechanism reduces maximum throughput and increases performance. In addition, the mechanism also reduces the number of multi-buffered local store buffers.
申请公布号 US2012203816(A1) 申请公布日期 2012.08.09
申请号 US201213451967 申请日期 2012.04.20
申请人 BROKENSHIRE DANIEL A.;GUNNELS JOHN A.;KISTLER MICHAEL D.;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BROKENSHIRE DANIEL A.;GUNNELS JOHN A.;KISTLER MICHAEL D.
分类号 G06F7/523 主分类号 G06F7/523
代理机构 代理人
主权项
地址