A Dynamic Deep Neural Network Design for Efficient Workload Allocation in Edge Computing

被引：34

作者：

Lo, Chi ^{[1
]}

Su, Yu-Yi ^{[1
]}

Lee, Chun-Yi ^{[1
]}

Chang, Shih-Chieh ^{[1
]}

机构：

[1] Natl Tsing Hua Univ, Dept Comp Sci, 101,Sec 2,Kuang Fu Rd, Hsinchu 30013, Taiwan

来源：

2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD) | 2017年

关键词：

Deep neural network; workload allocation; edge computing; authentic operation; dynamic network structure;

D O I：

10.1109/ICCD.2017.49

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Unreliable communication channels and limited computing resources at the edge end are two primary constraints of battery-powered movable devices, such as autonomous robots and unmanned aerial vehicles (UAVs). The impact is especially severe for those performing deep neural network (DNN) computations. With increasing demand for accuracy, the trend in modern DNN designs is the use of cascaded modularized layers. Implementing a deep network at the edge increases computational workloads and resource occupancy, leading to an increase in battery drain. Using a shallow network and offloading workloads to backbone servers, however, incur significant latency overheads caused by unstable communication channels. Hence, dynamic DNN design techniques for efficient workload allocation are urgently required to manage the amount of workload transmissions while achieving the required accuracy. In this paper, we explore the use of authentic operation (AO) unit and dynamic network structure to enhance DNNs. The AO unit defines a set of stochastic threshold values for different DNN output classes and determines at runtime if an input has to be transferred to backbone servers for further analysis. The dynamic network structure adjusts its depth according to channel availability. Experiments have been comprehensively performed on several well-known DNN models and datasets. Our results show that, on an average, the proposed techniques are able to reduce the amount of transmissions by up to 17% compared to previous methods under the same accuracy requirement.

引用

页码：273 / 280

页数：8

共 50 条

[21] Service Allocation/Placement in Multi-Access Edge Computing with Workload Fluctuations
Panda, Subrat Prasad
Ray, Kaustabha
Banerjee, Ansuman
SERVICE-ORIENTED COMPUTING (ICSOC 2021), 2021, 13121 : 747 - 755
[22] A Blockchain Framework for Efficient Resource Allocation in Edge Computing
Baranwal, Gaurav
Kumar, Dinesh
Biswas, Amit
Yadav, Ravi
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (04): : 3956 - 3970
[23] Deep Reinforcement Learning-Based Workload Scheduling for Edge Computing
Tao Zheng
Jian Wan
Jilin Zhang
Congfeng Jiang
Journal of Cloud Computing, 11
[24] Deep Reinforcement Learning-Based Workload Scheduling for Edge Computing
Zheng, Tao
Wan, Jian
Zhang, Jilin
Jiang, Congfeng
JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
[25] EasiEdge: A Novel Global Deep Neural Networks Pruning Method for Efficient Edge Computing
Yu, Fang
Cui, Li
Wang, Pengcheng
Han, Chuanqi
Huang, Ruoran
Huang, Xi
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (03): : 1259 - 1271
[26] Workload Allocation Mechanism for Minimum Service Delay in Edge Computing-Based Power Internet of Things
Niu, Xudong
Shao, Sujie
Xin, Chen
Zhou, Jun
Guo, Shaoyong
Chen, Xingyu
Qi, Feng
IEEE ACCESS, 2019, 7 : 83771 - 83784
[27] An Energy-Efficient Deep Neural Network Accelerator Design
Jung, Jueun
Lee, Kyuho Jason
2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 272 - 276
[28] Deep Unified Model For Face Recognition Based on Convolution Neural Network and Edge Computing
Khan, Muhammad Zeeshan
Harous, Saad
Ul Hassan, Saleet
Khan, Muhammad Usman Ghani
Iqbal, Razi
Mumtaz, Shahid
IEEE ACCESS, 2019, 7 : 72622 - 72633
[29] An Efficient Resource Allocation Scheme With Uncertain Network Status in Edge Computing-Enabled Networks
Cheng, Yuxia
Liang, Chengchao
Chen, Qianbin
Yu, F. Richard
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 1249 - 1263
[30] A Novel Predictive Model for Edge Computing Resource Scheduling Based on Deep Neural Network
Gao, Ming
Cai, Weiwei
Jiang, Yizhang
Hu, Wenjun
Yao, Jian
Qian, Pengjiang
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (01): : 259 - 277

← 1 2 3 4 5 →