Sparse Matrix-Vector Multiplication Cache Performance Evaluation and Design Exploration

被引：0

作者：

Cui, Jianfeng ^{[1
]}

Lu, Kai ^{[1
]}

Liu, Sheng ^{[2
]}

机构：

[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China

[2] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Hunan, Peoples R China

来源：

29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021) | 2021年

关键词：

SpMV; cache; sparse; matrix; PIN; simulation;

D O I：

10.1109/MASCOTS53633.2021.9614301

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we conducted a group of evaluations on the SpMV kernel with sequential implementation to investigate cache performance on single-core platforms. We verified a similar pattern inside a suite of sparse matrices covering various domains, which makes cache hit rate extraordinary inspiring in a sequential environment. This implicit regularity drove us to propose a cache space splitting approach, aiming at a better locality in dense vector accessing and utilization of large cache capacity in modern processors. Finally, we explored the design space of cache on Matrix 3000 GPDSP and proposed a group of cache parameters, based on our experimental results.

引用

页码：97 / 103

页数：7

共 50 条

[21] Recursive Hybrid Compression for Sparse Matrix-Vector Multiplication on GPU
Zhao, Zhixiang
Wu, Yanxia
Zhang, Guoyin
Yang, Yiqing
Hong, Ruize
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (4-5)
[22] Automatic tuning of sparse matrix-vector multiplication on multicore clusters
Li ShiGang
Hu ChangJun
Zhang JunChao
Zhang YunQuan
SCIENCE CHINA-INFORMATION SCIENCES, 2015, 58 (09) : 1 - 14
[23] Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications
Ashari, Arash
Sedaghati, Naser
Eisenlohr, John
Parthasarathy, Srinivasan
Sadayappan, P.
SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 781 - 792
[24] Characterizing Dataset Dependence for Sparse Matrix-Vector Multiplication on GPUs
Sedaghati, Naser
Ashari, Arash
Pouchet, Louis-Noel
Parthasarathy, Srinivasan
Sadayappan, P.
2ND WORKSHOP ON PARALLEL PROGRAMMING FOR ANALYTICS APPLICATIONS (PPAA 2015), 2015, : 17 - 24
[25] SparseX: A Library for High-Performance Sparse Matrix-Vector Multiplication on Multicore Platforms
Elafrou, Athena
Karakasis, Vasileios
Gkountouvas, Theodoros
Kourtis, Kornilios
Goumas, Georgios
Koziris, Nectarios
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2018, 44 (03):
[26] Optimization of sparse matrix-vector multiplication on emerging multicore platforms
Williams, Samuel
Oliker, Leonid
Vuduc, Richard
Shalf, John
Yelick, Katherine
Demmel, James
PARALLEL COMPUTING, 2009, 35 (03) : 178 - 194
[27] SIMD Parallel Sparse Matrix-Vector and Transposed-Matrix-Vector Multiplication in DD Precision
Hishinuma, Toshiaki
Hasegawa, Hidehiko
Tanaka, Teruo
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 21 - 34
[28] Joint direct and transposed sparse matrix-vector multiplication for multithreaded CPUs
Kozicky, Claudio
Simecek, Ivan
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (13)
[29] A High Memory Bandwidth FPGA Accelerator for Sparse Matrix-Vector Multiplication
Fowers, Jeremy
Ovtcharov, Kalin
Strauss, Karin
Chung, Eric S.
Stitt, Greg
2014 IEEE 22ND ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2014), 2014, : 36 - 43
[30] SpDRAM: Efficient In-DRAM Acceleration of Sparse Matrix-Vector Multiplication
Kang, Jieui
Choi, Soeun
Lee, Eunjin
Sim, Jaehyeong
IEEE ACCESS, 2024, 12 : 176009 - 176021

← 1 2 3 4 5 →