SpDRAM: Efficient In-DRAM Acceleration of Sparse Matrix-Vector Multiplication

被引：0

作者：

Kang, Jieui ^{[1
]}

Choi, Soeun ^{[1
]}

Lee, Eunjin ^{[1
]}

Sim, Jaehyeong ^{[2
]}

机构：

[1] Ewha Womans Univ, Artificial Intelligence Convergence, Seoul 03760, South Korea

[2] Ewha Womans Univ, Dept Comp Sci & Engn, Seoul 03760, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Random access memory; Sparse matrices; Computer architecture; Logic; Vectors; Turning; System-on-chip; Space exploration; Sorting; SRAM cells; Processing-in-memory; SpMV; sparsity; DRAM; ARCHITECTURE;

D O I：

10.1109/ACCESS.2024.3505622

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We introduce novel sparsity-aware in-DRAM matrix mapping techniques and a correspondingDRAM-based acceleration framework, termedSpDRAM, which utilizes a triple row activation schemeto efficiently handle sparse matrix-vector multiplication (SpMV). We found that reducing operationsby sparsity relies heavily on how matrices are mapped into DRAM banks, which operate row byrow. These banks operate row by row. From this insight, we developed two distinct matrix mappingtechniques aimed at maximizing the reduction of row operations with minimal design overhead: Output-aware Matrix Permutation (OMP) and Zero-aware Matrix Column Sorting (ZMCS). Additionally,we propose a Multiplication Deferring (MD) scheme that leverages the prevalent bit-level sparsity inmatrix values to decrease the effective bit-width required for in-bank multiplication operations. Evaluationresults demonstrate that the combination of our in-DRAM acceleration methods outperforms the latestDRAM-based PIM accelerator for SpMV, achieving a performance increase of up to 7.54xand a 22.4ximprovement in energy efficiency in a wide range of SpMV tasks

引用

页码：176009 / 176021

页数：13

共 50 条

[31] SIMD Parallel Sparse Matrix-Vector and Transposed-Matrix-Vector Multiplication in DD Precision
Hishinuma, Toshiaki
Hasegawa, Hidehiko
Tanaka, Teruo
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 21 - 34
[32] Joint direct and transposed sparse matrix-vector multiplication for multithreaded CPUs
Kozicky, Claudio
Simecek, Ivan
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (13)
[33] Sparse Matrix-Vector Multiplication Cache Performance Evaluation and Design Exploration
Cui, Jianfeng
Lu, Kai
Liu, Sheng
29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021), 2021, : 97 - 103
[34] CoAdELL: Adaptivity and Compression for Improving Sparse Matrix-Vector Multiplication on GPUs
Maggioni, Marco
Berger-Wolf, Tanya
PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 934 - 941
[35] Breaking the performance bottleneck of sparse matrix-vector multiplication on SIMD processors
Zhang, Kai
Chen, Shuming
Wang, Yaohua
Wan, Jianghua
IEICE ELECTRONICS EXPRESS, 2013, 10 (09):
[36] Speculative segmented sum for sparse matrix-vector multiplication on heterogeneous processors
Liu, Weifeng
Vinter, Brian
PARALLEL COMPUTING, 2015, 49 : 179 - 193
[37] A High Memory Bandwidth FPGA Accelerator for Sparse Matrix-Vector Multiplication
Fowers, Jeremy
Ovtcharov, Kalin
Strauss, Karin
Chung, Eric S.
Stitt, Greg
2014 IEEE 22ND ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2014), 2014, : 36 - 43
[38] Bitmap-Based Sparse Matrix-Vector Multiplication with Tensor Cores
Chen, YuAng
Yu, Jeffery Xu
53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 1135 - 1144
[39] Automatic Tuning of Sparse Matrix-Vector Multiplication for CRS format on GPUs
Yoshizawa, Hiroki
Takahashi, Daisuke
15TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2012) / 10TH IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2012), 2012, : 130 - 136
[40] Auto-tuning of Sparse Matrix-Vector Multiplication on Graphics Processors
Abu-Sufah, Walid
Karim, Asma Abdel
SUPERCOMPUTING (ISC 2013), 2013, 7905 : 151 - 164

← 1 2 3 4 5 →