发明名称 EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON PARALLEL PROCESSORS
摘要 A method of multiplication of a sparse matrix and a vector to obtain a new vector and a system for implementing the method are claimed. Embodiments of the method are intended to optimize the performance of sparse matrix-vector multiplication in highly parallel processors, such as GPUs. The sparse matrix is stored in compressed sparse row (CSR) format.
申请公布号 US2016140084(A1) 申请公布日期 2016.05.19
申请号 US201414542003 申请日期 2014.11.14
申请人 ADVANCED MICRO DEVICES, INC. 发明人 Daga Mayank;Greathouse Joseph L.
分类号 G06F17/16 主分类号 G06F17/16
代理机构 代理人
主权项 1. A method of multiplication of a sparse matrix and a vector to obtain a new vector, the sparse matrix being stored in compressed sparse row (CSR) format, the method to be executed on a parallel processor, the method comprising: partitioning the matrix into blocks of consecutive rows; and for each of said blocks having more than a minimum number of rows: determining a number of non-zero matrix elements in the block;executing a first process for sparse-matrix vector multiplication on rows of the block if a number of non-zero matrix elements in the block is less than or equal to a maximum; andexecuting a second process for sparse-matrix vector multiplication, distinct from the first process, if the number of non-zero matrix elements is greater than the maximum.
地址 Sunnyvale CA US