CNN Workloads Characterization and Integrated CPU-GPU DVFS Governors on Embedded Systems

被引:1
|
作者
Karzhaubayeva, Meruyert [1 ]
Amangeldi, Aidar [1 ]
Park, Jurn-Gyu [1 ]
机构
[1] Nazarbayev Univ, Sch Engn & Digital Sci, Astana 010000, Kazakhstan
关键词
Convolutional neural networks (CNNs); dynamic power management (DPM); embedded systems; MANAGEMENT;
D O I
10.1109/LES.2023.3299335
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic power management (DPM) techniques on mobile systems are indispensable for deep learning (DL) inference optimization, which is mainly performed on battery-based mobile and/or embedded platforms with constrained resources. To this end, we characterize CNN workloads using object detection applications of YOLOv4/-tiny and YOLOv3/-tiny, and then propose integrated CPU-GPU DVFS governor policies that scale integrated pairs of CPU and GPU frequencies to improve energy-delay product (EDP) with negligible inference execution time degradation. Our results show up to 16.7% EDP improvements with negligible (mostly less than 2%) performance degradation using object detection applications on NVIDIA Jetson TX2.
引用
收藏
页码:202 / 205
页数:4
相关论文
共 50 条
  • [1] Energy Efficient Job Scheduling with DVFS for CPU-GPU Heterogeneous Systems
    Chau, Vincent
    Chu, Xiaowen
    Liu, Hai
    Leung, Yiu-Wing
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON FUTURE ENERGY SYSTEMS (E-ENERGY'17), 2017, : 1 - 11
  • [2] Power-Aware Characterization and Mapping of Workloads on CPU-GPU Processors
    Dev, Kapil
    Zhan, Xin
    Reda, Sherief
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2016, : 225 - 226
  • [3] Cooperative DVFS for energy-efficient HEVC decoding on embedded CPU-GPU architecture
    Gong, Fan
    Ju, Lei
    Zhang, Deshan
    Zhao, Mengying
    Jia, Zhiping
    PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [4] Power Capping of CPU-GPU Heterogeneous Systems through Coordinating DVFS and Task Mapping
    Komoda, Toshiya
    Hayashi, Shingo
    Nakada, Takashi
    Miwa, Shinobu
    Nakamura, Hiroshi
    2013 IEEE 31ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2013, : 349 - 356
  • [5] Component Allocation Optimization for Heterogeneous CPU-GPU Embedded Systems
    Campeanu, Gabriel
    Carlson, Jan
    Sentilles, Severine
    2014 40TH EUROMICRO CONFERENCE SERIES ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2014), 2014, : 229 - 236
  • [6] A Lightweight DRDPG-Based RL DVFS for Video Rendering on CPU-GPU Integrated SoC
    Zhou, Qinxin
    Zhang, Yunfang
    Xu, Xinzi
    Zhang, Qichen
    Wu, Huaying
    Lian, Yong
    Zhao, Yang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (05) : 2119 - 2131
  • [7] A user mode CPU-GPU scheduling framework for hybrid workloads
    Wang, Bin
    Ma, Ruhui
    Qi, Zhengwei
    Yao, Jianguo
    Guan, Haibing
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 63 : 25 - 36
  • [8] Analyzing Memory Management Methods on Integrated CPU-GPU Systems
    Dashti, Mohammad
    Fedorova, Alexandra
    ACM SIGPLAN NOTICES, 2017, 52 (09) : 59 - 69
  • [9] A Simple Cache Coherence Scheme for Integrated CPU-GPU Systems
    Yudha, Ardhi Wiratama Baskara
    Pulungan, Reza
    Hoffmann, Henry
    Solihin, Yan
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
  • [10] Analyzing OpenCL 2.0 Workloads Using a Heterogeneous CPU-GPU Simulator
    Wang, Li
    Tsai, Ren-Wei
    Wang, Shao-Chung
    Chen, Kun-Chih
    Wang, Po-Han
    Cheng, Hsiang-Yun
    Lee, Yi-Chung
    Shu, Sheng-Jie
    Yang, Chun-Chieh
    Hsu, Min-Yih
    Kan, Li-Chen
    Lee, Chao-Lin
    Yu, Tzu-Chieh
    Peng, Rih-Ding
    Yang, Chia-Lin
    Hwang, Yuan-Shin
    Lee, Jenq-Kuen
    Tsao, Shiao-Li
    Ouhyoung, Ming
    2017 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2017, : 127 - 128