Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

被引:51
|
作者
Cai, Han [1 ]
Lin, Ji [1 ]
Lin, Yujun [1 ]
Liu, Zhijian [1 ]
Tang, Haotian [1 ]
Wang, Hanrui [1 ]
Zhu, Ligeng [1 ]
Han, Song [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Efficient deep learning; TinyML; model compression; AutoML; neural architecture search; NEURAL-NETWORK ACCELERATOR; ARCHITECTURE; IMPLEMENTATION; COPROCESSOR; PREDICTION; MODEL;
D O I
10.1145/3486618
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial intelligence (AI), including computer vision, natural language processing, and speech recognition. However, their superior performance comes at the considerable cost of computational complexity, which greatly hinders their applications in many resource-constrained devices, such as mobile phones and Internet of Things (IoT) devices. Therefore, methods and techniques that are able to lift the efficiency bottleneck while preserving the high accuracy of DNNs are in great demand to enable numerous edge AI applications. This article provides an overview of efficient deep learning methods, systems, and applications. We start from introducing popular model compression methods, including pruning, factorization, quantization, as well as compact model design. To reduce the large design cost of these manual solutions, we discuss the AutoML framework for each of them, such as neural architecture search (NAS) and automated pruning and quantization. We then cover efficient on-device training to enable user customization based on the local data on mobile devices. Apart from general acceleration techniques, we also showcase several task-specific accelerations for point cloud, video, and natural language processing by exploiting their spatial sparsity and temporal/token redundancy. Finally, to support all these algorithmic advancements, we introduce the efficient deep learning system design from both software and hardware perspectives.
引用
收藏
页数:50
相关论文
共 50 条
  • [1] Deep learning methods and applications for electrical power systems: A comprehensive review
    Ozcanli, Asiye K.
    Yaprakdal, Fatma
    Baysal, Mustafa
    INTERNATIONAL JOURNAL OF ENERGY RESEARCH, 2020, 44 (09) : 7136 - 7157
  • [2] Machine learning and deep learning in project analytics: methods, applications and research trends
    Uddin, Shahadat
    Yan, Sirui
    Lu, Haohui
    PRODUCTION PLANNING & CONTROL, 2024, : 873 - 892
  • [3] Current trends on the use of deep learning methods for image analysis in energy applications
    Casini, Mattia
    De Angelis, Paolo
    Chiavazzo, Eliodoro
    Bergamasco, Luca
    ENERGY AND AI, 2024, 15
  • [4] Deep learning for drug repurposing: Methods, databases, and applications
    Pan, Xiaoqin
    Lin, Xuan
    Cao, Dongsheng
    Zeng, Xiangxiang
    Yu, Philip S.
    He, Lifang
    Nussinov, Ruth
    Cheng, Feixiong
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2022, 12 (04)
  • [5] Methods and Applications of Mobile Molecular Communication
    Nakano, Tadashi
    Okaie, Yutaka
    Kobayashi, Shouhei
    Hara, Takahiro
    Hiraoka, Yasushi
    Haraguchi, Tokuko
    PROCEEDINGS OF THE IEEE, 2019, 107 (07) : 1442 - 1456
  • [6] A Survey on Theories and Applications for Self-Driving Cars Based on Deep Learning Methods
    Ni, Jianjun
    Chen, Yinan
    Chen, Yan
    Zhu, Jinxiu
    Ali, Deena
    Cao, Weidong
    APPLIED SCIENCES-BASEL, 2020, 10 (08):
  • [7] A survey on deep learning based reenactment methods for deepfake applications
    Dhanyalakshmi, Ramamurthy
    Popirlan, Claudiu-Ionut
    Hemanth, Duraisamy Jude
    IET IMAGE PROCESSING, 2024, : 4433 - 4460
  • [8] Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics
    Mosavi, Amirhosein
    Faghan, Yaser
    Ghamisi, Pedram
    Puhong Duan
    Ardabili, Sina Faizollahzadeh
    Salwana, Ely
    Band, Shahab S.
    MATHEMATICS, 2020, 8 (10)
  • [9] An Overview of Machine Learning within Embedded and Mobile Devices-Optimizations and Applications
    Ajani, Taiwo Samuel
    Imoize, Agbotiname Lucky
    Atayero, Aderemi A.
    SENSORS, 2021, 21 (13)
  • [10] Deep learning applications in protein crystallography
    Matinyan, Senik
    Filipcik, Pavel
    Abrahams, Jan Pieter
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2024, 80 : 1 - 17