Heterogeneous Sparse Matrix Computations on Hybrid GPU/CPU Platforms

被引:2
|
作者
Cardellini, Valeria [1 ]
Fanfarillo, Alessandro [1 ]
Filippone, Salvatore [2 ]
机构
[1] Univ Roma Tor Vergata, Dipartimento Ingn Civile & Ingn Informat, Rome, Italy
[2] Univ Roma Tor Vergata, Dipartimento Ingn Ind, Rome, Italy
来源
PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) | 2014年 / 25卷
关键词
Sparse Matrix Computations; Design Patterns; CUDA; GPGPU;
D O I
10.3233/978-1-61499-381-0-203
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hybrid GPU/CPU clusters are becoming very popular in the scientific computing community, as attested by the number of such systems present in the Top 500 list. In this paper, we address one of the key algorithms for scientific applications: the computation of sparse matrix-vector products that lies at the heart of iterative solvers for sparse linear systems. We detail how design patterns for sparse matrix computations enable us to easily adapt to such a heterogeneous GPU/CPU platform using several sparse matrix formats in order to achieve best performance; then, we analyze static load balancing strategies for devising a suitable data decomposition and propose our approach. We discuss our experience in using different sparse matrix formats and data partitioning algorithms with a number of computational experiments executed on three different hybrid GPU/CPU platforms.
引用
收藏
页码:203 / 212
页数:10
相关论文
共 50 条
  • [1] Design patterns for sparse-matrix computations on hybrid CPU/GPU platforms
    Cardellini, Valeria
    Filippone, Salvatore
    Rouson, Damian W. I.
    SCIENTIFIC PROGRAMMING, 2014, 22 (01) : 1 - 19
  • [2] CoopCL: Cooperative Execution of OpenCL Programs on Heterogeneous CPU-GPU Platforms
    Moren, Konrad
    Goehringer, Diana
    2020 28TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2020), 2020, : 224 - 231
  • [3] Parallelization of large vector similarity computations in a hybrid CPU plus GPU environment
    Czarnul, Pawe
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (02): : 768 - 786
  • [4] Parallelization of large vector similarity computations in a hybrid CPU+GPU environment
    Paweł Czarnul
    The Journal of Supercomputing, 2018, 74 : 768 - 786
  • [5] A collaborative CPU-GPU approach for principal component analysis on mobile heterogeneous platforms
    Valery, Olivier
    Liu, Pangfeng
    Wu, Jan-Jan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 120 : 44 - 61
  • [6] Exploiting task and data parallelism for advanced video coding on hybrid CPU + GPU platforms
    Svetislav Momcilovic
    Nuno Roma
    Leonel Sousa
    Journal of Real-Time Image Processing, 2016, 11 : 571 - 587
  • [7] Performance evaluation of hybrid programming patterns for large CPU/GPU heterogeneous clusters
    Lu, Fengshun
    Song, Junqiang
    Yin, Fukang
    Zhu, Xiaoqian
    COMPUTER PHYSICS COMMUNICATIONS, 2012, 183 (06) : 1172 - 1181
  • [8] Boosting CUDA Applications with CPU–GPU Hybrid Computing
    Changmin Lee
    Won Woo Ro
    Jean-Luc Gaudiot
    International Journal of Parallel Programming, 2014, 42 : 384 - 404
  • [9] Parallelization with load balancing of the weather scheme WSM7 for heterogeneous CPU-GPU platforms
    Jakobs, Thomas
    Kloeckner, Oliver
    Ruenger, Gudula
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (10): : 14645 - 14665
  • [10] Exploiting task and data parallelism for advanced video coding on hybrid CPU plus GPU platforms
    Momcilovic, Svetislav
    Roma, Nuno
    Sousa, Leonel
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 11 (03) : 571 - 587