A collaborative CPU-GPU approach for deep learning on mobile devices

被引:8
|
作者
Valery, Olivier [1 ,2 ]
Liu, Pangfeng [1 ,3 ]
Wu, Jan-Jan [1 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[3] Natl Taiwan Univ, Grad Inst Networking & Multimedia, Taipei, Taiwan
来源
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2019年 / 31卷 / 17期
关键词
deep learning; energy efficient; GPGPU; heterogeneous system; mobile computing; OpenCL; OPENCL;
D O I
10.1002/cpe.5225
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As mobile devices become more prevalent, users tend to reassess their expectations regarding the personalization of mobile services. The data collected by a mobile device's sensors provide an opportunity to gain insight into the user's profile. Recently, deep learning has gained momentum and has become the method of choice for solving machine learning problems. Interestingly, training a deep neural network on a mobile device is often mistakenly regarded as cumbersome. For instance, several deep learning frameworks only provide a CPU-based implementation for prediction tasks on a mobile device. In contrast to servers, a mobile computing environment imposes many domain-specific constraints that invite us to review the general computing approach used in a deep learning framework implementation. In this paper, we propose a deep learning framework that has been specifically designed for mobile device platforms. Our approach relies on the collaboration of the multicore CPU and the integrated GPU to accelerate deep learning computation on mobile devices. Our work exploits the shared memory architecture of mobile devices to promote CPU-GPU collaboration without any data copying. We analyze our approach with regard to three factors: performance/portability trade-off, power efficiency, and memory management.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] OPTiC: Optimizing Collaborative CPU-GPU Computing on Mobile Devices With Thermal Constraints
    Wang, Siqi
    Ananthanarayanan, Gayathri
    Mitra, Tulika
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (03) : 393 - 406
  • [2] A collaborative CPU-GPU approach for principal component analysis on mobile heterogeneous platforms
    Valery, Olivier
    Liu, Pangfeng
    Wu, Jan-Jan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 120 : 44 - 61
  • [3] An Adaptive On-line CPU-GPU Governor for Games on Mobile Devices
    Chuang, Po-Kai
    Chen, Ya-Shu
    Huang, Po-Hao
    2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2017, : 653 - 658
  • [4] Fault-tolerant deep learning inference on CPU-GPU integrated edge devices with TEEs
    Xu, Hongjian
    Liao, Longlong
    Liu, Xinqi
    Chen, Shuguang
    Chen, Jianguo
    Liang, Zhixuan
    Yu, Yuanlong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 161 : 404 - 414
  • [5] A CPU-GPU Cooperative Sorting Approach
    Raju, K.
    Chiplunkar, Niranjan N.
    Rajanikanth, Kavoor
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
  • [6] Accelerating Pattern Matching with CPU-GPU Collaborative Computing
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT I, 2018, 11334 : 310 - 322
  • [7] Machine Learning Based Predictive Models in Mobile Platforms Using CPU-GPU
    Sohankar, Javad
    Pore, Madhurima
    Banerjee, Ayan
    Sadeghi, Koosha
    Gupta, Sandeep K. S.
    2020 7TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS: SYSTEMS, MANAGEMENT AND SECURITY (IOTSMS), 2020,
  • [8] Demystifying the TensorFlow Eager Execution of Deep Learning Inference on a CPU-GPU Tandem
    Delestrac, Paul
    Torres, Lionel
    Novo, David
    2022 25TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2022, : 446 - 455
  • [9] Deep learning based data prefetching in CPU-GPU unified virtual memory
    Long, Xinjian
    Gong, Xiangyang
    Zhang, Bo
    Zhou, Huiyang
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 174 : 19 - 31
  • [10] Task Scheduling of Parallel Processing in CPU-GPU Collaborative Environment
    Wang, Lei
    Huang, Yong-zhong
    Chen, Xin
    Zhang, Chun-yan
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 228 - +