A Lightweight Collaborative Deep Neural Network for the Mobile Web in Edge Cloud

被引:30
作者
Huang, Yakun [1 ]
Qiao, Xiuquan [1 ]
Ren, Pei [1 ]
Liu, Ling [2 ]
Pu, Calton [2 ]
Dustdar, Schahram [3 ]
Chen, Junliang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] Georgia Inst Technol, Sch Comp Sci, Atlanta, GA 30332 USA
[3] Tech Univ Wien, Distributed Syst Grp, A-1040 Vienna, Austria
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Cloud computing; Servers; Edge computing; Throughput; Task analysis; Computational modeling; Performance evaluation; Collaborative DNNs; mobile web; binary neural network; dynamic allcoation; edge computing; AR;
D O I
10.1109/TMC.2020.3043051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enabling deep learning technology on the mobile web can improve the user's experience for achieving web artificial intelligence in various fields. However, heavy DNN models and limited computing resources of the mobile web are now unable to support executing computationally intensive DNNs when deploying in a cloud computing platform. With the help of promising edge computing, we propose a lightweight collaborative deep neural network for the mobile web, named LcDNN, which contributes to three aspects: (1) We design a composite collaborative DNN that reduces the model size, accelerates inference, and reduces mobile energy cost by executing a lightweight binary neural network (BNN) branch on the mobile web. (2) We provide a jointly training method for LcDNN and implement an energy-efficient inference library for executing the BNN branch on the mobile web. (3) To further promote the resource utilization of the edge cloud, we develop a DRL-based online scheduling scheme to obtain an optimal allocation for LcDNN. The experimental results show that LcDNN outperforms existing approaches for reducing the model size by about 16x to 29x. It also reduces the end-to-end latency and mobile energy cost with acceptable accuracy and improves the throughput and resource utilization of the edge cloud.
引用
收藏
页码:2289 / 2305
页数:17
相关论文
共 50 条
  • [1] Alex K., 2009, LEARNING MULTIPLE LA
  • [2] Anil R., 2018, arXiv preprint arXiv:1804.03235, P1
  • [3] [Anonymous], 2018, W3C STRATEGIC HIGHLI
  • [4] [Anonymous], 2018, MACHINE LEARNING WEB
  • [5] [Anonymous], 2017, WONDER SHAPER
  • [6] [Anonymous], 2019, P 2 SYSML C
  • [7] Bandanau D, 2016, INT CONF ACOUST SPEE, P4945, DOI 10.1109/ICASSP.2016.7472618
  • [8] Caffe.js framework, 2017, DEEP LEARN BROWS
  • [9] Courbariaux M, 2015, ADV NEUR IN, V28
  • [10] JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services
    Eshratifar, Amir Erfan
    Abrishami, Mohammad Saeed
    Pedram, Massoud
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (02) : 565 - 576