A Dynamic Deep Neural Network Design for Efficient Workload Allocation in Edge Computing

被引:34
|
作者
Lo, Chi [1 ]
Su, Yu-Yi [1 ]
Lee, Chun-Yi [1 ]
Chang, Shih-Chieh [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, 101,Sec 2,Kuang Fu Rd, Hsinchu 30013, Taiwan
来源
2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD) | 2017年
关键词
Deep neural network; workload allocation; edge computing; authentic operation; dynamic network structure;
D O I
10.1109/ICCD.2017.49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Unreliable communication channels and limited computing resources at the edge end are two primary constraints of battery-powered movable devices, such as autonomous robots and unmanned aerial vehicles (UAVs). The impact is especially severe for those performing deep neural network (DNN) computations. With increasing demand for accuracy, the trend in modern DNN designs is the use of cascaded modularized layers. Implementing a deep network at the edge increases computational workloads and resource occupancy, leading to an increase in battery drain. Using a shallow network and offloading workloads to backbone servers, however, incur significant latency overheads caused by unstable communication channels. Hence, dynamic DNN design techniques for efficient workload allocation are urgently required to manage the amount of workload transmissions while achieving the required accuracy. In this paper, we explore the use of authentic operation (AO) unit and dynamic network structure to enhance DNNs. The AO unit defines a set of stochastic threshold values for different DNN output classes and determines at runtime if an input has to be transferred to backbone servers for further analysis. The dynamic network structure adjusts its depth according to channel availability. Experiments have been comprehensively performed on several well-known DNN models and datasets. Our results show that, on an average, the proposed techniques are able to reduce the amount of transmissions by up to 17% compared to previous methods under the same accuracy requirement.
引用
收藏
页码:273 / 280
页数:8
相关论文
共 50 条
  • [31] An Energy-Efficient Method for Recurrent Neural Network Inference in Edge Cloud Computing
    Chen, Chao
    Guo, Weiyu
    Wang, Zheng
    Yang, Yongkui
    Wu, Zhuoyu
    Li, Guannan
    SYMMETRY-BASEL, 2022, 14 (12):
  • [32] An Effective Training Scheme for Deep Neural Network in Edge Computing Enabled Internet of Medical Things (IoMT) Systems
    Pustokhina, Irina Valeryevna
    Pustokhin, Denis Alexandrovich
    Gupta, Deepak
    Khanna, Ashish
    Shankar, K.
    Gia Nhu Nguyen
    IEEE ACCESS, 2020, 8 : 107112 - 107123
  • [33] Energy-efficient Incremental Offloading of Neural Network Computations in Mobile Edge Computing
    Guo, Guangfeng
    Zhang, Junxing
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [34] Edge computing network resource allocation based on virtual network embedding
    Zhan, Keqiang
    Chen, Ning
    Kumar, Sripathi Venkata Naga Santhosh
    Kibalya, Godfrey
    Zhang, Peiying
    Zhang, Hongxia
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2022, 38 (01)
  • [35] Efficient Task Allocation for Computation Offloading in Vehicular Edge Computing
    Zhang, Zheng
    Zeng, Feng
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (06) : 5595 - 5606
  • [36] Deep Reinforcement Learning Based Edge Computing Network Aided Resource Allocation Algorithm for Smart Grid
    Chi, Yingying
    Zhang, Yi
    Liu, Yong
    Zhu, Hailong
    Zheng, Zhe
    Liu, Rui
    Zhang, Peiying
    IEEE ACCESS, 2023, 11 : 6541 - 6550
  • [37] An online dynamic pricing framework for resource allocation in edge computing
    Chen, Sheng
    Chen, Baochao
    Tao, Xiaoyi
    Xie, Xin
    Li, Keqiu
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 133
  • [38] Quantization of Deep Neural Networks for Accurate Edge Computing
    Chen, Wentao
    Qiu, Hailong
    Zhuang, Jian
    Zhang, Chutong
    Hu, Yu
    Lu, Qing
    Wang, Tianchen
    Shi, Yiyu
    Huang, Meiping
    Xu, Xiaowe
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2021, 17 (04)
  • [39] Design and Implementation of a Convolutional Neural Network on an Edge Computing Smartphone for Human Activity Recognition
    Zebin, Tahmina
    Scully, Patricia J.
    Peek, Niels
    Casson, Alexander J.
    Ozanyan, Krikor B.
    IEEE ACCESS, 2019, 7 : 133509 - 133520
  • [40] Joint system consumption minimization and workload allocation in edge-core network
    Wang Shaochun
    Lu Zhaoming
    Wen Xiangming
    Wang Luhan
    Ma Lu
    The Journal of China Universities of Posts and Telecommunications, 2018, 25 (03) : 45 - 54