Feedback Control Optimization for Performance and Energy Efficiency on CPU-GPU Heterogeneous Systems

被引:2
|
作者
Lin, Feng-Sheng [1 ]
Liu, Po-Ting [2 ]
Li, Ming-Hua [1 ]
Hsiung, Pao-Ann [2 ]
机构
[1] Ind Technol Res Inst, Informat & Commun Labs, Hsinchu 31040, Taiwan
[2] Natl Chung Cheng Univ, Dept Comp Sci & Informat Technol, Chiayi 62102, Taiwan
来源
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2016 | 2016年 / 10048卷
关键词
CPU; GPU; Heterogeneous system; Frequency scaling; Workload division; Performance; Energy efficiency; POWER;
D O I
10.1007/978-3-319-49583-5_29
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Owing to the rising awareness of environment protection, high performance is not the only aim in system design, energy efficiency has increasingly become an important goal. In accordance with this goal, heterogeneous systems which are more efficient than CPU-based homogeneous systems, and occupying a growing proportion in the Top500 and the Green500 lists. Nevertheless, heterogeneous system design being more complex presents greater challenges in achieving a good tradeoff between performance and energy efficiency for applications running on such systems. To address the performance energy tradeoff issue in CPU-GPU heterogeneous systems, we propose a novel feedback control optimization (FCO) method that alternates between frequency scaling of device and division of kernel workload between CPU and GPU. Given a kernel and a workload division, frequency scaling involves finding near-optimal core frequency of the CPU and of the GPU. Further, an iterative algorithm is proposed for finding a near-optimal workload division that balance workload between CPU and GPU at a frequency that was optimal for the previous workload division. The frequency scaling phase and workload division phase are alternatively performed until the proposed FCO method converges and finds a configuration including core frequency for CPU, core frequency for GPU, and the workload division. Experiments show that compared with the state-of-the-art GreenGPU method, performance can be improved by 7.9%, while energy consumption can be reduced by 4.16%.
引用
收藏
页码:388 / 404
页数:17
相关论文
共 50 条
  • [21] Optimizing sparse matrix partitioning in a heterogeneous CPU-GPU system for high-performance
    Ahmad Shokrani Baigi
    Abdorreza Savadi
    Mahmoud Naghibzadeh
    Computing, 2025, 107 (4)
  • [22] Performance Analysis of Big Data ETL Process over CPU-GPU Heterogeneous Architectures
    Lee, Suyeon
    Park, Sungyong
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2021), 2021, : 42 - 47
  • [23] Hybrid-Smash: A Heterogeneous CPU-GPU Compression Library
    Penaranda, Cristian
    Reano, Carlos
    Silla, Federico
    IEEE ACCESS, 2024, 12 : 32706 - 32723
  • [24] Heterogeneous Computing (CPU-GPU) for Pollution Dispersion in an Urban Environment
    Fernandez, Gonzalo
    Mendina, Mariana
    Usera, Gabriel
    COMPUTATION, 2020, 8 (01)
  • [25] Application of CPU-GPU heterogeneous system in optical remote sensing image processing
    Dang Y.
    Wang X.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2020, 49
  • [26] A novel heterogeneous algorithm to simulate multiphase flow in porous media on multicore CPU-GPU systems
    McClure, J. E.
    Prins, J. F.
    Miller, C. T.
    COMPUTER PHYSICS COMMUNICATIONS, 2014, 185 (07) : 1865 - 1874
  • [27] A Heterogeneous CPU-GPU Implementation for Discrete Elements Simulation with Multiple GPUs
    Tian, Yuan
    Qi, Ji
    Lai, Junjie
    Zhou, Qingguo
    Yang, Lei
    2013 INTERNATIONAL JOINT CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY & UBI-MEDIA COMPUTING (ICAST-UMEDIA), 2013, : 547 - +
  • [28] A Step towards Energy Efficient Computing: Redesigning A Hydrodynamic Application on CPU-GPU
    Dong, Tingxing
    Dobrev, Veselin
    Kolev, Tzanio
    Rieben, Robert
    Tomov, Stanimire
    Dongarra, Jack
    2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [29] Performance comparison of CPU and GPU on a discrete heterogeneous architecture
    Thomas, Winnie
    Daruwala, Rohin D.
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 271 - 276
  • [30] A Simple Cache Coherence Scheme for Integrated CPU-GPU Systems
    Yudha, Ardhi Wiratama Baskara
    Pulungan, Reza
    Hoffmann, Henry
    Solihin, Yan
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,