SpDRAM: Efficient In-DRAM Acceleration of Sparse Matrix-Vector Multiplication

被引:0
作者
Kang, Jieui [1 ]
Choi, Soeun [1 ]
Lee, Eunjin [1 ]
Sim, Jaehyeong [2 ]
机构
[1] Ewha Womans Univ, Artificial Intelligence Convergence, Seoul 03760, South Korea
[2] Ewha Womans Univ, Dept Comp Sci & Engn, Seoul 03760, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Random access memory; Sparse matrices; Computer architecture; Logic; Vectors; Turning; System-on-chip; Space exploration; Sorting; SRAM cells; Processing-in-memory; SpMV; sparsity; DRAM; ARCHITECTURE;
D O I
10.1109/ACCESS.2024.3505622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce novel sparsity-aware in-DRAM matrix mapping techniques and a correspondingDRAM-based acceleration framework, termedSpDRAM, which utilizes a triple row activation schemeto efficiently handle sparse matrix-vector multiplication (SpMV). We found that reducing operationsby sparsity relies heavily on how matrices are mapped into DRAM banks, which operate row byrow. These banks operate row by row. From this insight, we developed two distinct matrix mappingtechniques aimed at maximizing the reduction of row operations with minimal design overhead: Output-aware Matrix Permutation (OMP) and Zero-aware Matrix Column Sorting (ZMCS). Additionally,we propose a Multiplication Deferring (MD) scheme that leverages the prevalent bit-level sparsity inmatrix values to decrease the effective bit-width required for in-bank multiplication operations. Evaluationresults demonstrate that the combination of our in-DRAM acceleration methods outperforms the latestDRAM-based PIM accelerator for SpMV, achieving a performance increase of up to 7.54xand a 22.4ximprovement in energy efficiency in a wide range of SpMV tasks
引用
收藏
页码:176009 / 176021
页数:13
相关论文
共 50 条
  • [31] SIMD Parallel Sparse Matrix-Vector and Transposed-Matrix-Vector Multiplication in DD Precision
    Hishinuma, Toshiaki
    Hasegawa, Hidehiko
    Tanaka, Teruo
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 21 - 34
  • [32] Joint direct and transposed sparse matrix-vector multiplication for multithreaded CPUs
    Kozicky, Claudio
    Simecek, Ivan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (13)
  • [33] Sparse Matrix-Vector Multiplication Cache Performance Evaluation and Design Exploration
    Cui, Jianfeng
    Lu, Kai
    Liu, Sheng
    29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021), 2021, : 97 - 103
  • [34] CoAdELL: Adaptivity and Compression for Improving Sparse Matrix-Vector Multiplication on GPUs
    Maggioni, Marco
    Berger-Wolf, Tanya
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 934 - 941
  • [35] Breaking the performance bottleneck of sparse matrix-vector multiplication on SIMD processors
    Zhang, Kai
    Chen, Shuming
    Wang, Yaohua
    Wan, Jianghua
    IEICE ELECTRONICS EXPRESS, 2013, 10 (09):
  • [36] Speculative segmented sum for sparse matrix-vector multiplication on heterogeneous processors
    Liu, Weifeng
    Vinter, Brian
    PARALLEL COMPUTING, 2015, 49 : 179 - 193
  • [37] A High Memory Bandwidth FPGA Accelerator for Sparse Matrix-Vector Multiplication
    Fowers, Jeremy
    Ovtcharov, Kalin
    Strauss, Karin
    Chung, Eric S.
    Stitt, Greg
    2014 IEEE 22ND ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2014), 2014, : 36 - 43
  • [38] Bitmap-Based Sparse Matrix-Vector Multiplication with Tensor Cores
    Chen, YuAng
    Yu, Jeffery Xu
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 1135 - 1144
  • [39] Automatic Tuning of Sparse Matrix-Vector Multiplication for CRS format on GPUs
    Yoshizawa, Hiroki
    Takahashi, Daisuke
    15TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2012) / 10TH IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2012), 2012, : 130 - 136
  • [40] Auto-tuning of Sparse Matrix-Vector Multiplication on Graphics Processors
    Abu-Sufah, Walid
    Karim, Asma Abdel
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 151 - 164