A Dynamic Deep Neural Network Design for Efficient Workload Allocation in Edge Computing

被引:34
|
作者
Lo, Chi [1 ]
Su, Yu-Yi [1 ]
Lee, Chun-Yi [1 ]
Chang, Shih-Chieh [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, 101,Sec 2,Kuang Fu Rd, Hsinchu 30013, Taiwan
来源
2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD) | 2017年
关键词
Deep neural network; workload allocation; edge computing; authentic operation; dynamic network structure;
D O I
10.1109/ICCD.2017.49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Unreliable communication channels and limited computing resources at the edge end are two primary constraints of battery-powered movable devices, such as autonomous robots and unmanned aerial vehicles (UAVs). The impact is especially severe for those performing deep neural network (DNN) computations. With increasing demand for accuracy, the trend in modern DNN designs is the use of cascaded modularized layers. Implementing a deep network at the edge increases computational workloads and resource occupancy, leading to an increase in battery drain. Using a shallow network and offloading workloads to backbone servers, however, incur significant latency overheads caused by unstable communication channels. Hence, dynamic DNN design techniques for efficient workload allocation are urgently required to manage the amount of workload transmissions while achieving the required accuracy. In this paper, we explore the use of authentic operation (AO) unit and dynamic network structure to enhance DNNs. The AO unit defines a set of stochastic threshold values for different DNN output classes and determines at runtime if an input has to be transferred to backbone servers for further analysis. The dynamic network structure adjusts its depth according to channel availability. Experiments have been comprehensively performed on several well-known DNN models and datasets. Our results show that, on an average, the proposed techniques are able to reduce the amount of transmissions by up to 17% compared to previous methods under the same accuracy requirement.
引用
收藏
页码:273 / 280
页数:8
相关论文
共 50 条
  • [1] Dynamic Workload Allocation for Edge Computing
    Hung, Yi-Wen
    Chen, Yung-Chih
    Lo, Chi
    So, Austin Go
    Chang, Shih-Chieh
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (03) : 519 - 529
  • [2] Design and Implementation of Deep Neural Network for Edge Computing
    Zhang, Junyang
    Guo, Yang
    Hu, Xiao
    Li, Rongzhen
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (08) : 1982 - 1996
  • [3] High Speed and Energy Efficient Deep Neural Network for Edge Computing
    Bai, Kangjun
    Liu, Shiya
    Yi, Yang
    SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 347 - 349
  • [4] Workload Prediction in Edge Computing based on Graph Neural Network
    Miao, WeiWei
    Zeng, Zeng
    Zhang, Mingxuan
    Quan, Siping
    Zhang, Zhen
    Li, Shihao
    Zhang, Li
    Sun, Qi
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1663 - 1666
  • [5] Efficient Deep Neural Networks for Edge Computing
    Alnemari, Mohammed
    Bagherzadeh, Nader
    2019 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING (IEEE EDGE), 2019, : 1 - 7
  • [6] Application Aware Workload Allocation for Edge Computing-Based IoT
    Fan, Qiang
    Ansari, Nirwan
    IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (03): : 2146 - 2153
  • [7] Energy-Efficient and Delay-Guaranteed Workload Allocation in IoT-Edge-Cloud Computing Systems
    Guo, Mian
    Li, Lei
    Guan, Quansheng
    IEEE ACCESS, 2019, 7 : 78685 - 78697
  • [8] (Poster) Blockchain-based Edge Computing for Deep Neural Network Applications
    Kim, Jae-Yun
    Moon, Soo-Mook
    WORKSHOP PROCEEDINGS 2018: INTELLIGENT EMBEDDED SYSTEMS ARCHITECTURES AND APPLICATIONS (INTESA), 2018, : 53 - 55
  • [9] Cooperative Distributed Deep Neural Network Deployment with Edge Computing
    Yang, Cian-You
    Kuo, Jian-Jhih
    Sheu, Jang-Ping
    Zheng, Ke-Jun
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [10] Energy-efficient Workload Allocation and Computation Resource Configuration in Distributed Cloud/Edge Computing Systems With Stochastic Workloads
    Zhang, Wenyu
    Zhang, Zhenjiang
    Zeadally, Sherali
    Chao, Han-Chieh
    Leung, Victor C. M.
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (06) : 1118 - 1132