CNN Workloads Characterization and Integrated CPU-GPU DVFS Governors on Embedded Systems

被引:1
|
作者
Karzhaubayeva, Meruyert [1 ]
Amangeldi, Aidar [1 ]
Park, Jurn-Gyu [1 ]
机构
[1] Nazarbayev Univ, Sch Engn & Digital Sci, Astana 010000, Kazakhstan
关键词
Convolutional neural networks (CNNs); dynamic power management (DPM); embedded systems; MANAGEMENT;
D O I
10.1109/LES.2023.3299335
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic power management (DPM) techniques on mobile systems are indispensable for deep learning (DL) inference optimization, which is mainly performed on battery-based mobile and/or embedded platforms with constrained resources. To this end, we characterize CNN workloads using object detection applications of YOLOv4/-tiny and YOLOv3/-tiny, and then propose integrated CPU-GPU DVFS governor policies that scale integrated pairs of CPU and GPU frequencies to improve energy-delay product (EDP) with negligible inference execution time degradation. Our results show up to 16.7% EDP improvements with negligible (mostly less than 2%) performance degradation using object detection applications on NVIDIA Jetson TX2.
引用
收藏
页码:202 / 205
页数:4
相关论文
共 50 条
  • [41] PARALLEL SOLVER FOR SHIFTED SYSTEMS IN A HYBRID CPU-GPU FRAMEWORK
    Bosnery, Nela
    Bujanovic, Zvonimir
    Drmac, Zlatko
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (04): : C605 - C633
  • [42] Accelerating image convolution filtering algorithms on integrated CPU-GPU architectures
    Zhou, Yi
    He, Fazhi
    Qiu, Yimin
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (03)
  • [43] Implementation and Analysis of GNSS Software Receiver on Embedded CPU-GPU Heterogeneous Architecture
    Park, Kwi Woo
    Jang, Woo Jin
    Park, Chansik
    Kim, Sunwoo
    Lee, Min Jun
    PROCEEDINGS OF THE 29TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2016), 2016, : 70 - 76
  • [44] Understanding Idle Behavior and Power Gating Mechanisms in the Context of Modern Benchmarks on CPU-GPU Integrated Systems
    Arora, Manish
    Manne, Srilatha
    Paul, Indrani
    Jayasena, Nuwan
    Tullsen, Dean M.
    2015 IEEE 21ST INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2015, : 366 - 377
  • [45] Noniterative Multireference Coupled Cluster Methods on Heterogeneous CPU-GPU Systems
    Bhaskaran-Nair, Kiran
    Ma, Wenjing
    Krishnamoorthy, Sriram
    Villa, Oreste
    van Dam, Hubertus J. J.
    Apra, Edoardo
    Kowalski, Karol
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2013, 9 (04) : 1949 - 1957
  • [46] Integrated CPU-GPU Power Management for 3D Mobile Games
    Pathania, Anuj
    Jiao, Qing
    Prakash, Alok
    Mitra, Tulika
    2014 51ST ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2014,
  • [47] Asynchronous Processing for Latent Fingerprint Identification on Heterogeneous CPU-GPU Systems
    Sanchez-Fernandez, Andres J.
    Romero, Luis F.
    Peralta, Daniel
    Medina-Perez, Miguel Angel
    Saeys, Yvan
    Herrera, Francisco
    Tabik, Siham
    IEEE ACCESS, 2020, 8 (08): : 124236 - 124253
  • [48] A hybrid computing method of SpMV on CPU-GPU heterogeneous computing systems
    Yang, Wangdong
    Li, Kenli
    Li, Keqin
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 104 : 49 - 60
  • [49] A Runtime Workload Distribution with Resource Allocation for CPU-GPU Heterogeneous Systems
    Alsubaihi, Shouq
    Gaudiot, Jean-Luc
    2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 994 - 1003
  • [50] Mixed-Cell-Height Legalization on CPU-GPU Heterogeneous Systems
    Yang, Haoyu
    Fung, Kit
    Zhao, Yuxuan
    Lin, Yibo
    Yu, Bei
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 784 - 789