Feedback Control Optimization for Performance and Energy Efficiency on CPU-GPU Heterogeneous Systems

被引:2
作者
Lin, Feng-Sheng [1 ]
Liu, Po-Ting [2 ]
Li, Ming-Hua [1 ]
Hsiung, Pao-Ann [2 ]
机构
[1] Ind Technol Res Inst, Informat & Commun Labs, Hsinchu 31040, Taiwan
[2] Natl Chung Cheng Univ, Dept Comp Sci & Informat Technol, Chiayi 62102, Taiwan
来源
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2016 | 2016年 / 10048卷
关键词
CPU; GPU; Heterogeneous system; Frequency scaling; Workload division; Performance; Energy efficiency; POWER;
D O I
10.1007/978-3-319-49583-5_29
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Owing to the rising awareness of environment protection, high performance is not the only aim in system design, energy efficiency has increasingly become an important goal. In accordance with this goal, heterogeneous systems which are more efficient than CPU-based homogeneous systems, and occupying a growing proportion in the Top500 and the Green500 lists. Nevertheless, heterogeneous system design being more complex presents greater challenges in achieving a good tradeoff between performance and energy efficiency for applications running on such systems. To address the performance energy tradeoff issue in CPU-GPU heterogeneous systems, we propose a novel feedback control optimization (FCO) method that alternates between frequency scaling of device and division of kernel workload between CPU and GPU. Given a kernel and a workload division, frequency scaling involves finding near-optimal core frequency of the CPU and of the GPU. Further, an iterative algorithm is proposed for finding a near-optimal workload division that balance workload between CPU and GPU at a frequency that was optimal for the previous workload division. The frequency scaling phase and workload division phase are alternatively performed until the proposed FCO method converges and finds a configuration including core frequency for CPU, core frequency for GPU, and the workload division. Experiments show that compared with the state-of-the-art GreenGPU method, performance can be improved by 7.9%, while energy consumption can be reduced by 4.16%.
引用
收藏
页码:388 / 404
页数:17
相关论文
共 50 条
  • [31] PARALLEL SOLVER FOR SHIFTED SYSTEMS IN A HYBRID CPU-GPU FRAMEWORK
    Bosnery, Nela
    Bujanovic, Zvonimir
    Drmac, Zlatko
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (04) : C605 - C633
  • [32] GSched: An efficient scheduler for hybrid CPU-GPU HPC systems
    Mateos, Mariano Raboso
    Robles, Juan Antonio Cotobal
    1600, Springer Verlag (217): : 179 - 185
  • [33] A distributed in-memory key-value store system on heterogeneous CPU-GPU cluster
    Zhang, Kai
    Wang, Kaibo
    Yuan, Yuan
    Guo, Lei
    Li, Rubao
    Zhang, Xiaodong
    He, Bingsheng
    Hu, Jiayu
    Hua, Bei
    VLDB JOURNAL, 2017, 26 (05) : 729 - 750
  • [34] A Scalable and Portable Approach to Accelerate Hybrid HPL on Heterogeneous CPU-GPU Clusters
    Shi, Rong
    Potluri, Sreeram
    Hamidouche, Khald
    Lu, Xiaoyi
    Tomko, Karen
    Panda, Dhabaleswar K.
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [35] A Black-Box Approach to Energy-Aware Scheduling on Integrated CPU-GPU Systems
    Barik, Rajkishore
    Farooqui, Naila
    Lewis, Brian T.
    Hu, Chunling
    Shpeisman, Tatiana
    PROCEEDINGS OF CGO 2016: THE 14TH INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2016, : 70 - 81
  • [36] A collaborative CPU-GPU approach for principal component analysis on mobile heterogeneous platforms
    Valery, Olivier
    Liu, Pangfeng
    Wu, Jan-Jan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 120 : 44 - 61
  • [37] An approach to optimise the energy efficiency of iterative computation on integrated GPU-CPU systems
    Garzon, E. M.
    Moreno, J. J.
    Martinez, J. A.
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (01) : 114 - 125
  • [38] Using Criticality of GPU Accesses in Memory Management for CPU-GPU Heterogeneous Multi-Core Processors
    Rai, Siddharth
    Chaudhuri, Mainak
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2017, 16
  • [39] Cholesky Factorization on Heterogeneous CPU and GPU Systems
    Chen, Jieyang
    Chen, Zizhong
    2015 NINTH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY FCST 2015, 2015, : 19 - 26
  • [40] User Interaction Aware Reinforcement Learning for Power and Thermal Efficiency of CPU-GPU Mobile MPSoCs
    Dey, Somdip
    Singh, Amit Kumar
    Wang, Xiaohang
    McDonald-Maier, Klaus
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 1728 - 1733