发明名称 Matrix transposition in a computer system
摘要 Improved transposition of a matrix in a computer system may be accomplished while utilizing at most a single permutation vector. This greatly improves the speed and parallelability of the transpose operation. For a standard rectangular matrix having M rows and N columns and a size MxN, first n and q are determined, wherein N=n*q, and wherein Mxq represents a block size and wherein N is evenly divisible by p. Then, the matrix is partitioned into n columns of size Mxq. Then for each column n, elements are sequentially read within the column row-wise and sequentially written into a cache, then sequentially read from the cache and sequentially written row-wise back into the matrix in a memory in a column of size qxM. A permutation vector may then be applied to the matrix to arrive at the transpose. This method may be modified for special cases, such as square matrices, to further improve efficiency.
申请公布号 US7031994(B2) 申请公布日期 2006.04.18
申请号 US20020218312 申请日期 2002.08.13
申请人 SUN MICROSYSTEMS, INC. 发明人 LAO SHANDONG;LEWIS BRADLEY ROMAIN;BOUCHER MICHAEL LEE
分类号 G06F7/00;G06F7/78;G06F17/14 主分类号 G06F7/00
代理机构 代理人
主权项
地址