HPSM: a programming framework to exploit multi-CPU and multi-GPU systems simultaneously

被引:0
|
作者
Ferreira Lima, Joao Vicente [1 ]
Di Domenico, Daniel [1 ]
机构
[1] Univ Fed Santa Maria, Santa Maria, RS, Brazil
关键词
high performance computing; CPU-GPU systems; parallel programming models; high-level framework; parallel loops;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a high-level C++ framework to explore multi-CPU and multi-GPU systems called HPSM. HPSM enables execution of parallel loops and reductions simultaneously over CPUs and GPUs using three parallel backends: Serial, OpenMP, and StarPU. We analysed HPSM development effort with AXPY program through two standard metrics (NCLOC and ES). In addition, we evaluated performance and energy with three parallel benchmarks: N-Body, Hotspot, and CFD solver. HPSM reduced code effort by up to 56.9% compared to StarPU C interface, although it resulted in 2.5x more lines of code compared to OpenMP. The CPU-GPU combination attained speedup results with Hotspot of up to 92.7x on a X86-based system with four GPUs and up to 108.2x on an IBM POWER8+ system with two GPUs. On both systems, the addition of GPUs improved energy efficiency.
引用
收藏
页码:201 / 211
页数:11
相关论文
共 32 条
  • [1] An adaptive methodology for multi-GPU programming in OpenCL
    Cavalcanti Bueno, Andre Luis
    Rodriguez, Noemi de La Rocque
    Sotelino, Elisa Dominguez
    ENGINEERING COMPUTATIONS, 2017, 34 (04) : 1277 - 1292
  • [2] CASE: A Compiler-Assisted SchEduling Framework for Multi-GPU Systems
    Chen, Chao
    Porter, Chris
    Pande, Santosh
    PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2022, : 17 - 31
  • [3] PSkel: A stencil programming framework for CPU-GPU systems
    Pereira, Alyson D.
    Ramos, Luiz
    Goes, Luis F. W.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 4938 - 4953
  • [4] Consumer Level Multi-GPU Systems Utilization, Efficiency, and Optimization
    Ross, John Brandon
    2013 PROCEEDINGS OF IEEE SOUTHEASTCON, 2013,
  • [5] Multi-GPU Implementation of k-Nearest Neighbor Algorithm
    Masek, Jan
    Burget, Kadim
    Karasek, Jan
    Uher, Vaclav
    Dutta, Malay Kishore
    2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015, : 764 - 767
  • [6] OpenACC Multi-GPU Approach for WSM6 Microphysics
    da Silva, Hercules Cardoso
    Stefanes, Marco A.
    Capistrano, Vinicius
    2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), 2021, : 382 - 387
  • [7] Accelerated CFD computations on multi-GPU using OpenMP and OpenACC
    Harshad Bhusare
    Nandan Sarkar
    Debajyoti Kumar
    Somnath Roy
    Sādhanā, 49
  • [8] Accelerated CFD computations on multi-GPU using OpenMP and OpenACC
    Bhusare, Harshad
    Sarkar, Nandan
    Kumar, Debajyoti
    Roy, Somnath
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2024, 49 (01):
  • [9] A multi-GPU based high-performance computing framework in elastodynamics simulation using octree meshes
    Mohammadian, Shayan
    Kumar, Ankit S.
    Song, Chongmin
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2025, 436
  • [10] SeisSol on Distributed Multi-GPU Systems: CUDA Code Generation for the Modal Discontinuous Galerkin Method
    Dorozhinskii, Ravil
    Bader, Michael
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2021), 2020, : 69 - 82