Sparse Matrix-Vector Multiplication Cache Performance Evaluation and Design Exploration

被引：0

作者：

Cui, Jianfeng ^{[1
]}

Lu, Kai ^{[1
]}

Liu, Sheng ^{[2
]}

机构：

[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China

[2] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Hunan, Peoples R China

来源：

29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021) | 2021年

关键词：

SpMV; cache; sparse; matrix; PIN; simulation;

D O I：

10.1109/MASCOTS53633.2021.9614301

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we conducted a group of evaluations on the SpMV kernel with sequential implementation to investigate cache performance on single-core platforms. We verified a similar pattern inside a suite of sparse matrices covering various domains, which makes cache hit rate extraordinary inspiring in a sequential environment. This implicit regularity drove us to propose a cache space splitting approach, aiming at a better locality in dense vector accessing and utilization of large cache capacity in modern processors. Finally, we explored the design space of cache on Matrix 3000 GPDSP and proposed a group of cache parameters, based on our experimental results.

引用

页码：97 / 103

页数：7

共 50 条

[41] Sparse Matrix-Vector Multiplication Optimizations based on Matrix Bandwidth Reduction using NVIDIA CUDA
Xu, Shiming
Lin, Hai Xiang
Xue, Wei
PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES 2010), 2010, : 609 - 614
[42] Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors
Elafrou, Athena
Goumas, Georgios
Koziris, Nectarios
2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 292 - 301
[43] Optimization of Sparse Matrix-Vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs
Mukunoki, Daichi
Takahashi, Daisuke
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT V, 2013, 7975 : 211 - 223
[44] An Efficient Two-Dimensional Blocking Strategy for Sparse Matrix-Vector Multiplication on GPUs
Ashari, Arash
Sedaghati, Naser
Eisenlohr, John
Sadayappan, P.
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, (ICS'14), 2014, : 273 - 282
[45] The Sliced COO format for Sparse Matrix-Vector Multiplication on CUDA-enabled GPUs
Dang, Hoang-Vu
Schmidt, Bertil
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 : 57 - 66
[46] Exploring Better Speculation and Data Locality in Sparse Matrix-Vector Multiplication on Intel Xeon
Zhao, Haoran
Xia, Tian
Li, Chenyang
Zhao, Wenzhe
Zheng, Nanning
Ren, Pengju
2020 IEEE 38TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2020), 2020, : 601 - 609
[47] A Method of Zero/One Matrix-Vector Multiplication
Yu, Li
Liu, Xia
Chen, Ang
Li, Yuan-Xiang
PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE OF MODELLING AND SIMULATION, VOL III: MODELLING AND SIMULATION IN ELECTRONICS, COMPUTING, AND BIO-MEDICINE, 2008, : 95 - 99
[48] Efficient dense matrix-vector multiplication on GPU
He, Guixia
Gao, Jiaquan
Wang, Jun
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (19)
[49] Design space exploration for sparse matrix-matrix multiplication on FPGAs
Lin, Colin Yu
Wong, Ngai
So, Hayden Kwok-Hay
INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2013, 41 (02) : 205 - 219
[50] Merge-based Sparse Matrix-Vector Multiplication (SpMV) using the CSR Storage Format
Merrill, Duane
Garland, Michael
ACM SIGPLAN NOTICES, 2016, 51 (08) : 389 - 390

← 1 2 3 4 5 →