A Dynamic Deep Neural Network Design for Efficient Workload Allocation in Edge Computing

被引:34
|
作者
Lo, Chi [1 ]
Su, Yu-Yi [1 ]
Lee, Chun-Yi [1 ]
Chang, Shih-Chieh [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, 101,Sec 2,Kuang Fu Rd, Hsinchu 30013, Taiwan
来源
2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD) | 2017年
关键词
Deep neural network; workload allocation; edge computing; authentic operation; dynamic network structure;
D O I
10.1109/ICCD.2017.49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Unreliable communication channels and limited computing resources at the edge end are two primary constraints of battery-powered movable devices, such as autonomous robots and unmanned aerial vehicles (UAVs). The impact is especially severe for those performing deep neural network (DNN) computations. With increasing demand for accuracy, the trend in modern DNN designs is the use of cascaded modularized layers. Implementing a deep network at the edge increases computational workloads and resource occupancy, leading to an increase in battery drain. Using a shallow network and offloading workloads to backbone servers, however, incur significant latency overheads caused by unstable communication channels. Hence, dynamic DNN design techniques for efficient workload allocation are urgently required to manage the amount of workload transmissions while achieving the required accuracy. In this paper, we explore the use of authentic operation (AO) unit and dynamic network structure to enhance DNNs. The AO unit defines a set of stochastic threshold values for different DNN output classes and determines at runtime if an input has to be transferred to backbone servers for further analysis. The dynamic network structure adjusts its depth according to channel availability. Experiments have been comprehensively performed on several well-known DNN models and datasets. Our results show that, on an average, the proposed techniques are able to reduce the amount of transmissions by up to 17% compared to previous methods under the same accuracy requirement.
引用
收藏
页码:273 / 280
页数:8
相关论文
共 50 条
  • [21] Service Allocation/Placement in Multi-Access Edge Computing with Workload Fluctuations
    Panda, Subrat Prasad
    Ray, Kaustabha
    Banerjee, Ansuman
    SERVICE-ORIENTED COMPUTING (ICSOC 2021), 2021, 13121 : 747 - 755
  • [22] A Blockchain Framework for Efficient Resource Allocation in Edge Computing
    Baranwal, Gaurav
    Kumar, Dinesh
    Biswas, Amit
    Yadav, Ravi
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (04): : 3956 - 3970
  • [23] Deep Reinforcement Learning-Based Workload Scheduling for Edge Computing
    Tao Zheng
    Jian Wan
    Jilin Zhang
    Congfeng Jiang
    Journal of Cloud Computing, 11
  • [24] Deep Reinforcement Learning-Based Workload Scheduling for Edge Computing
    Zheng, Tao
    Wan, Jian
    Zhang, Jilin
    Jiang, Congfeng
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
  • [25] EasiEdge: A Novel Global Deep Neural Networks Pruning Method for Efficient Edge Computing
    Yu, Fang
    Cui, Li
    Wang, Pengcheng
    Han, Chuanqi
    Huang, Ruoran
    Huang, Xi
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (03): : 1259 - 1271
  • [26] Workload Allocation Mechanism for Minimum Service Delay in Edge Computing-Based Power Internet of Things
    Niu, Xudong
    Shao, Sujie
    Xin, Chen
    Zhou, Jun
    Guo, Shaoyong
    Chen, Xingyu
    Qi, Feng
    IEEE ACCESS, 2019, 7 : 83771 - 83784
  • [27] An Energy-Efficient Deep Neural Network Accelerator Design
    Jung, Jueun
    Lee, Kyuho Jason
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 272 - 276
  • [28] Deep Unified Model For Face Recognition Based on Convolution Neural Network and Edge Computing
    Khan, Muhammad Zeeshan
    Harous, Saad
    Ul Hassan, Saleet
    Khan, Muhammad Usman Ghani
    Iqbal, Razi
    Mumtaz, Shahid
    IEEE ACCESS, 2019, 7 : 72622 - 72633
  • [29] An Efficient Resource Allocation Scheme With Uncertain Network Status in Edge Computing-Enabled Networks
    Cheng, Yuxia
    Liang, Chengchao
    Chen, Qianbin
    Yu, F. Richard
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 1249 - 1263
  • [30] A Novel Predictive Model for Edge Computing Resource Scheduling Based on Deep Neural Network
    Gao, Ming
    Cai, Weiwei
    Jiang, Yizhang
    Hu, Wenjun
    Yao, Jian
    Qian, Pengjiang
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (01): : 259 - 277