Cooperative DVFS for energy-efficient HEVC decoding on embedded CPU-GPU architecture

被引:3
|
作者
Gong, Fan [1 ]
Ju, Lei [1 ]
Zhang, Deshan [2 ]
Zhao, Mengying [1 ]
Jia, Zhiping [1 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan, Shandong, Peoples R China
[2] Inspur Co Ltd, Jinan, Shandong, Peoples R China
关键词
DVFS; HEVC; power management; heterogeneous computing;
D O I
10.1145/3061639.3062216
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The next generation video coding standard High Efficiency Video Coding (HEVC) provides better compression rate for high resolution videos, at the cost of substantially higher computational complexity. While some latest off-the-shelf consumer electronics support HEVC via ASIC solutions, software implementation of real-time HEVC remains an open challenge for resource-constraint embedded systems. In this work, we present an HEVC decoder design on a low-power embedded heterogeneous multiprocessor System-on-Chip (HMPSoC) with CPU and GPU. Our analysis shows that the massive parallel architecture of GPU leads to a relatively smooth fluctuation on the processing time between video frames. Moreover, the dynamic workload of each frame has a monotonic correlation with a particular coding parameter that can be obtained at decoding time. Based on these observations, we propose an application-specific userspace CPU-GPU DVFS scheme which effectively saves the energy consumption for HEVC decoding. Furthermore, given our accurate workload prediction, only a small frame buffer is required to ensure real-time video decoding.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Energy-Efficient Run-Time Mapping and Thread Partitioning of Concurrent OpenCL Applications on CPU-GPU MPSoCs
    Singh, Amit Kumar
    Prakash, Alok
    Basireddy, Karunakar Reddy
    Merrett, Geoff V.
    Al-Hashimi, Bashir M.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2017, 16
  • [22] Highly parallel HEVC decoding for heterogeneous systems with CPU and GPU
    Wang, Biao
    de Souza, Diego Felix
    Alvarez-Mesa, Mauricio
    Chi, Chi Ching
    Juurlink, Ben
    Ilic, Aleksandar
    Roma, Nuno
    Sousa, Leonel
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 62 : 93 - 105
  • [23] Power Capping of CPU-GPU Heterogeneous Systems through Coordinating DVFS and Task Mapping
    Komoda, Toshiya
    Hayashi, Shingo
    Nakada, Takashi
    Miwa, Shinobu
    Nakamura, Hiroshi
    2013 IEEE 31ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2013, : 349 - 356
  • [24] A 2-Layer Component-Based Architecture for Heterogeneous CPU-GPU Embedded Systems
    Campeanu, Gabriel
    Saadatmand, Mehrdad
    INFORMATION TECHNOLOGY: NEW GENERATIONS, 2016, 448 : 629 - 639
  • [25] A Step towards Energy Efficient Computing: Redesigning A Hydrodynamic Application on CPU-GPU
    Dong, Tingxing
    Dobrev, Veselin
    Kolev, Tzanio
    Rieben, Robert
    Tomov, Stanimire
    Dongarra, Jack
    2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [26] Embedded CPU-GPU pupil tracking for eye motion stabilization
    Kowalski, Bartlomiej
    Martz, Eric P.
    Dubra, Alfredo
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [27] Component Allocation Optimization for Heterogeneous CPU-GPU Embedded Systems
    Campeanu, Gabriel
    Carlson, Jan
    Sentilles, Severine
    2014 40TH EUROMICRO CONFERENCE SERIES ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2014), 2014, : 229 - 236
  • [28] EFFICIENT PARALLEL PROCESSING BY IMPROVED CPU-GPU INTERACTION
    Khatter, Harsh
    Aggarwal, Vaishali
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ISSUES AND CHALLENGES IN INTELLIGENT COMPUTING TECHNIQUES (ICICT), 2014, : 159 - 161
  • [29] Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems
    Yu, Yuanhang
    Wen, Dong
    Zhang, Ying
    Wang, Xiaoyang
    Zhang, Wenjie
    Lin, Xuemin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1871 - 1876
  • [30] Efficient Pattern Matching on CPU-GPU Heterogeneous Systems
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING (ICA3PP 2019), PT I, 2020, 11944 : 391 - 403