CPU-GPU Tuning for Modern Scientific Applications using Node-Level Heterogeneity

被引:0
|
作者
Thavappiragasam, Mathialakan [1 ]
Kale, Vivek [2 ]
机构
[1] Argonne Natl Lab, Lemont, IL 60439 USA
[2] Sandia Natl Labs, Livermore, CA USA
来源
2023 IEEE 30TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC 2023 | 2023年
关键词
inter-device concurrency; performance tuning; CUDA; OpenMP; supercomputer; GPU; CPU; workflows; AI/ML;
D O I
10.1109/HiPC58850.2023.00034
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific applications must be tuned for performance to run efficiently on supercomputers having nodes with a CPU (or, a general-purpose host processor) and GPUs (or, accelerator device processors). Conventional wisdom suggests focusing tuning of applications for a GPU and making the CPU only have the role of offloading computation to the GPU, given the CPU's relatively miniscule amount of computational power. However, this is overly conservative for modern scientific applications, which include those using scientific workflows with real-time data constraints and AI/ML with low numerical precision requirements. This work identifies new performance opportunities for modern scientific applications via CPU-GPU tuning, a strategy that unifies and integrates tuning of the CPU and GPU performance parameters. Applying CPU-GPU tuning to a dot product representative of these applications run on the widely-used Summit supercomputer results in up to an 8.15x speedup. These results provide groundwork for auto-tuning software for applications run on supercomputers having node-level heterogeneity.
引用
收藏
页码:179 / 183
页数:5
相关论文
共 31 条
  • [1] Heterogeneous programming using OpenMP and CUDA/HIP for hybrid CPU-GPU scientific applications
    Tallada, Marc Gonzalez
    Morancho, Enric
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2023, 37 (05) : 626 - 646
  • [2] Boosting CUDA Applications with CPU-GPU Hybrid Computing
    Lee, Changmin
    Ro, Won Woo
    Gaudiot, Jean-Luc
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2014, 42 (02) : 384 - 404
  • [3] BigKernel - High Performance CPU-GPU Communication Pipelining for Big Data-style Applications
    Mokhtari, Reza
    Stumm, Michael
    2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [4] Performance Improvement of CUDA Applications by Reducing CPU-GPU Data Transfer Overhead
    Sunitha, N., V
    Raju, K.
    Chiplunkar, Niranjan N.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 211 - 215
  • [5] Speeding up Planning in Multiagent Settings Using CPU-GPU Architectures
    Adoe, Fadel
    Chen, Yingke
    Doshi, Prashant
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2015, 2015, 9494 : 262 - 283
  • [6] Training Convolutional Neural Networks with Differential Evolution using Concurrent Task Apportioning on Hybrid CPU-GPU Architectures
    Venkat, Rochan Avlur
    Oussalem, Zakaria
    Bhattacharya, Arya Kumar
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 2567 - 2576
  • [7] Co-Scheduling on Fused CPU-GPU Architectures With Shared Last Level Caches
    Damschen, Marvin
    Mueller, Frank
    Henkel, Joerg
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (11) : 2337 - 2347
  • [8] Improved Parallel Implementation of 1D Discrete Wavelet Transform Using CPU-GPU
    Rodriguez-Martinez, Eduardo
    Benavides-Alvarez, Cesar
    Aviles-Cruz, Carlos
    Lopez-Saca, Fidel
    Ferreyra-Ramirez, Andres
    ELECTRONICS, 2023, 12 (16)
  • [9] Implementation of Cubic Spline Interpolation on Parallel Skeleton using Pipeline Model on CPU-GPU Cluster
    Mohanty, Prasant Kumar
    Reza, Motahar
    Kumar, Piyush
    Kumar, Praveen
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 747 - 751
  • [10] Simultaneous parallel power flow calculations using hybrid CPU-GPU approach
    Araujo, Igor
    Tadaiesky, Vincent
    Cardoso, Diego
    Fukuyama, Yoshikazu
    Santana, Adamo
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2019, 105 : 229 - 236