Structural Agnostic SpMV: Adapting CSR-Adaptive for Irregular Matrices

被引:25
作者
Daga, Mayank [1 ]
Greathouse, Joseph L. [1 ]
机构
[1] Adv Micro Devices Inc, AMD Res, Sunnyvale, CA 94088 USA
来源
2015 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC) | 2015年
关键词
D O I
10.1109/HiPC.2015.55
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sparse matrix vector multiplication (SpMV) is an important linear algebra primitive. Recent research has focused on improving the performance of SpMV on GPUs when using compressed sparse row (CSR), the most frequently used matrix storage format on CPUs. Efficient CSR-based SpMV obviates the need for other GPU-specific storage formats, thereby saving runtime and storage overheads. However, existing CSR-based SpMV algorithms on GPUs perform poorly on irregular sparse matrices, limiting their usefulness. We propose a novel approach for SpMV on GPUs which works well for both regular and irregular matrices while keeping the CSR format intact. We start with CSR-Adaptive, which dynamically chooses between two SpMV algorithms depending on the length of each row. We then add a series of performance improvements, such as a more efficient reduction technique. Finally, we add a third algorithm which uses multiple parallel execution units when operating on irregular matrices with very long rows. Our implementation dynamically assigns the best algorithm to sets of rows in order to ensure that the GPU is efficiently utilized. We effectively double the performance of CSR-Adaptive, which had previously demonstrated better performance than algorithms that use other storage formats. In addition, our implementation is 36% faster than CSR5, the current state of the art for SpMV on GPUs.
引用
收藏
页码:64 / 74
页数:11
相关论文
共 23 条
  • [1] [Anonymous], 2003, CITESEERX
  • [2] [Anonymous], INT WORKSH GPUS SCI
  • [3] [Anonymous], P INT C HIGH PERF EM
  • [4] Ashari A, 2014, P INT C HIGH PERF CO
  • [5] Bell N, 2009, PROCEEDINGS OF THE CONFERENCE ON HIGH PERFORMANCE COMPUTING NETWORKING, STORAGE AND ANALYSIS
  • [6] Choi J. W., 2010, P S PRINC PRACT PAR
  • [7] Danalis A., 2010, P WORKSH GEN PURP CO
  • [8] An overview of the Sparse Basic Linear Algebra Subprograms: The new standard from the BLAS Technical Forum
    Duff, IS
    Heroux, MA
    Pozo, R
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2002, 28 (02): : 239 - 267
  • [9] Greathouse J.L., 2014, P INT C HIGH PERF CO
  • [10] Gropp W. D., 1999, P INT PAR COMP FLUID