HPSM: a programming framework to exploit multi-CPU and multi-GPU systems simultaneously

被引：0

作者：

Ferreira Lima, Joao Vicente ^{[1
]}

Di Domenico, Daniel ^{[1
]}

机构：

[1] Univ Fed Santa Maria, Santa Maria, RS, Brazil

来源：

INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING | 2019年 / 10卷 / 03期

关键词：

high performance computing; CPU-GPU systems; parallel programming models; high-level framework; parallel loops;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a high-level C++ framework to explore multi-CPU and multi-GPU systems called HPSM. HPSM enables execution of parallel loops and reductions simultaneously over CPUs and GPUs using three parallel backends: Serial, OpenMP, and StarPU. We analysed HPSM development effort with AXPY program through two standard metrics (NCLOC and ES). In addition, we evaluated performance and energy with three parallel benchmarks: N-Body, Hotspot, and CFD solver. HPSM reduced code effort by up to 56.9% compared to StarPU C interface, although it resulted in 2.5x more lines of code compared to OpenMP. The CPU-GPU combination attained speedup results with Hotspot of up to 92.7x on a X86-based system with four GPUs and up to 108.2x on an IBM POWER8+ system with two GPUs. On both systems, the addition of GPUs improved energy efficiency.

引用

页码：201 / 211

页数：11

共 32 条

[1] An adaptive methodology for multi-GPU programming in OpenCL
Cavalcanti Bueno, Andre Luis
Rodriguez, Noemi de La Rocque
Sotelino, Elisa Dominguez
ENGINEERING COMPUTATIONS, 2017, 34 (04) : 1277 - 1292
[2] CASE: A Compiler-Assisted SchEduling Framework for Multi-GPU Systems
Chen, Chao
Porter, Chris
Pande, Santosh
PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2022, : 17 - 31
[3] PSkel: A stencil programming framework for CPU-GPU systems
Pereira, Alyson D.
Ramos, Luiz
Goes, Luis F. W.
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 4938 - 4953
[4] Consumer Level Multi-GPU Systems Utilization, Efficiency, and Optimization
Ross, John Brandon
2013 PROCEEDINGS OF IEEE SOUTHEASTCON, 2013,
[5] Multi-GPU Implementation of k-Nearest Neighbor Algorithm
Masek, Jan
Burget, Kadim
Karasek, Jan
Uher, Vaclav
Dutta, Malay Kishore
2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015, : 764 - 767
[6] OpenACC Multi-GPU Approach for WSM6 Microphysics
da Silva, Hercules Cardoso
Stefanes, Marco A.
Capistrano, Vinicius
2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), 2021, : 382 - 387
[7] Accelerated CFD computations on multi-GPU using OpenMP and OpenACC
Harshad Bhusare
Nandan Sarkar
Debajyoti Kumar
Somnath Roy
Sādhanā, 49
[8] Accelerated CFD computations on multi-GPU using OpenMP and OpenACC
Bhusare, Harshad
Sarkar, Nandan
Kumar, Debajyoti
Roy, Somnath
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2024, 49 (01):
[9] A multi-GPU based high-performance computing framework in elastodynamics simulation using octree meshes
Mohammadian, Shayan
Kumar, Ankit S.
Song, Chongmin
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2025, 436
[10] SeisSol on Distributed Multi-GPU Systems: CUDA Code Generation for the Modal Discontinuous Galerkin Method
Dorozhinskii, Ravil
Bader, Michael
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2021), 2020, : 69 - 82

← 1 2 3 4 →