SpDRAM: Efficient In-DRAM Acceleration of Sparse Matrix-Vector Multiplication

被引:0
作者
Kang, Jieui [1 ]
Choi, Soeun [1 ]
Lee, Eunjin [1 ]
Sim, Jaehyeong [2 ]
机构
[1] Ewha Womans Univ, Artificial Intelligence Convergence, Seoul 03760, South Korea
[2] Ewha Womans Univ, Dept Comp Sci & Engn, Seoul 03760, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Random access memory; Sparse matrices; Computer architecture; Logic; Vectors; Turning; System-on-chip; Space exploration; Sorting; SRAM cells; Processing-in-memory; SpMV; sparsity; DRAM; ARCHITECTURE;
D O I
10.1109/ACCESS.2024.3505622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce novel sparsity-aware in-DRAM matrix mapping techniques and a correspondingDRAM-based acceleration framework, termedSpDRAM, which utilizes a triple row activation schemeto efficiently handle sparse matrix-vector multiplication (SpMV). We found that reducing operationsby sparsity relies heavily on how matrices are mapped into DRAM banks, which operate row byrow. These banks operate row by row. From this insight, we developed two distinct matrix mappingtechniques aimed at maximizing the reduction of row operations with minimal design overhead: Output-aware Matrix Permutation (OMP) and Zero-aware Matrix Column Sorting (ZMCS). Additionally,we propose a Multiplication Deferring (MD) scheme that leverages the prevalent bit-level sparsity inmatrix values to decrease the effective bit-width required for in-bank multiplication operations. Evaluationresults demonstrate that the combination of our in-DRAM acceleration methods outperforms the latestDRAM-based PIM accelerator for SpMV, achieving a performance increase of up to 7.54xand a 22.4ximprovement in energy efficiency in a wide range of SpMV tasks
引用
收藏
页码:176009 / 176021
页数:13
相关论文
共 50 条
  • [41] Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Intel Xeon Phi
    Elafrou, Athena
    Goumas, Georgios
    Koziris, Nectarios
    2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 1389 - 1398
  • [42] Optimized Data Reuse via Reordering for Sparse Matrix-Vector Multiplication on FPGAs
    Li, Shiqing
    Liu, Di
    Liu, Weichen
    2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,
  • [43] A Fully Structure-Driven Performance Analysis of Sparse Matrix-Vector Multiplication
    Sandhu, Prabhjot
    Verbrugge, Clark
    Hendren, Laurie
    PROCEEDINGS OF THE ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE'20), 2020, : 108 - 119
  • [44] A Storage Optimization Scheme for PITD Method Using Sparse Matrix-Vector Multiplication
    Ma, Liang
    Ma, Xikui
    Chi, Mingjun
    Xiang, Ru
    Zhu, Xiaojie
    IEEE MICROWAVE AND WIRELESS TECHNOLOGY LETTERS, 2025, 35 (02): : 145 - 148
  • [45] A Streaming Dataflow Engine for Sparse Matrix-Vector Multiplication Using High-Level Synthesis
    Hosseinabady, Mohammad
    Nunez-Yanez, Jose Luis
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (06) : 1272 - 1285
  • [46] An Effective Approach for Implementing Sparse Matrix-Vector Multiplication on Graphics Processing Units
    Abu-Sufah, Walid
    Karim, Asma Abdel
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 453 - 460
  • [47] SMAT: An Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication
    Li, Jiajia
    Tan, Guangming
    Chen, Mingyu
    Sun, Ninghui
    ACM SIGPLAN NOTICES, 2013, 48 (06) : 117 - 126
  • [48] Multi-Mode Transprecision Sparse Matrix-Vector Multiplication Engine for PageRank
    Kim, Whijin
    Lee, Jihye
    Kim, Sujin
    Kim, Ji-Hoon
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [49] Sparse Matrix-Vector Multiplication Optimizations based on Matrix Bandwidth Reduction using NVIDIA CUDA
    Xu, Shiming
    Lin, Hai Xiang
    Xue, Wei
    PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES 2010), 2010, : 609 - 614
  • [50] Toward Energy-Efficient Sparse Matrix-Vector Multiplication with Near STT-MRAM Computing Architecture
    Yueting Li
    He Zhang
    Xueyan Wang
    Hao Cai
    Yundong Zhang
    Shuqin Lv
    Renguang Liu
    Weisheng Zhao
    2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC, 2023, : 222 - 227