Cost-Driven Off-Loading for DNN-Based Applications Over Cloud, Edge, and End Devices

被引:130
作者
Lin, Bin [1 ,2 ,3 ,4 ]
Huang, Yinhao [5 ,6 ]
Zhang, Jianshan [5 ,6 ]
Hu, Junqin [5 ,6 ]
Chen, Xing [5 ,6 ]
Li, Jun [7 ,8 ]
机构
[1] Fujian Normal Univ, Coll Phys & Energy, Fujian Prov Key Lab Quantum Manipulat & New Energ, Fuzhou 350117, Peoples R China
[2] Fujian Prov Collaborat Innovat Ctr Optoelect Semi, Xiamen 361005, Peoples R China
[3] Fujian Prov Univ, Engn Res Ctr Big Data Applicat Private Hlth Med, Putian 351100, Fujian, Peoples R China
[4] Fujian Prov Collaborat Innovat Ctr Adv High Field, Fuzhou 350117, Fujian, Peoples R China
[5] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350118, Peoples R China
[6] Fujian Prov Key Lab Network Comp & Intelligent In, Fuzhou 350118, Peoples R China
[7] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
[8] Natl Res Tomsk Polytech Univ, Sch Comp Sci & Robot, Tomsk 634050, Russia
基金
国家重点研发计划;
关键词
Cloud computing; cost-driven off-loading; deep neural networks (DNNs); edge computing; workflow scheduling; PARTICLE SWARM OPTIMIZATION; ALGORITHM;
D O I
10.1109/TII.2019.2961237
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, deep neural networks (DNNs) have achieved a great success in various applications. Traditional deployment for DNNs in the cloud may incur a prohibitively serious delay in transferring input data from the end devices to the cloud. To address this problem, the hybrid computing environments, consisting of the cloud, edge, and end devices, are adopted to offload DNN layers by combining the larger layers (more amount of data) in the cloud and the smaller layers (less amount of data) at the edge and end devices. A key issue in hybrid computing environments is how to minimize the system cost while accomplishing the offloaded layers with their deadline constraints. In this article, a self-adaptive discrete particle swarm optimization (PSO) algorithm using the genetic algorithm (GA) operators is proposed to reduce the system cost caused by data transmission and layer execution. This approach considers the characteristics of DNNs partitioning and layers off-loading over the cloud, edge, and end devices. The mutation operator and crossover operator of GA are adopted to avert the premature convergence of PSO, which distinctly reduces the system cost through enhanced population diversity of PSO. The proposed off-loading strategy is compared with benchmark solutions, and the results show that our strategy can effectively reduce the system cost of off-loading for DNN-based applications over the cloud, edge and end devices relative to the benchmarks.
引用
收藏
页码:5456 / 5466
页数:11
相关论文
共 32 条
[1]  
[Anonymous], 2019, IEEE T CLOUD COMPUT
[2]   A Genetic Algorithm Based Data Replica Placement Strategy for Scientific Applications in Clouds [J].
Cui, Lizhen ;
Zhang, Junhua ;
Yue, Lingxi ;
Shi, Yuliang ;
Li, Hui ;
Yuan, Dong .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2018, 11 (04) :727-739
[3]   QoS-Aware Scheduling of Heterogeneous Servers for Inference in Deep Neural Networks [J].
Fang, Zhou ;
Yu, Tong ;
Mengshoel, Ole J. ;
Gupta, Rajesh K. .
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, :2067-2070
[4]   Cost-Driven Scheduling for Deadline-Based Workflow Across Multiple Clouds [J].
Guo, Wenzhong ;
Lin, Bing ;
Chen, Guolong ;
Chen, Yuzhong ;
Liang, Feng .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2018, 15 (04) :1571-1585
[5]   Effects of Leader Motivating Language on Employee Task and Contextual Performance: The Mediating Role of Feedback Quality [J].
Guo, Yue ;
Lin, Bin .
PSYCHOLOGICAL REPORTS, 2020, 123 (06) :2501-2518
[6]  
Han C, 2016, PROCEEDINGS OF THE 12TH EURO-ASIA CONFERENCE ON ENVIRONMENT AND CSR: TOURISM, SOCIETY AND EDUCATION SESSION, PT II, P123
[7]   Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks [J].
He, Ying ;
Zhang, Zheng ;
Yu, F. Richard ;
Zhao, Nan ;
Yin, Hongxi ;
Leung, Victor C. M. ;
Zhang, Yanhua .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (11) :10433-10445
[8]  
Hochba DS., 1997, ACM SIGACT NEWS, V28, P40, DOI DOI 10.1145/261342.571216
[9]  
Hu C, 2019, IEEE INFOCOM SER, P1423, DOI [10.1109/infocom.2019.8737614, 10.1109/INFOCOM.2019.8737614]
[10]  
Jeong HJ, 2019, INT CONF PERVAS COMP, P451, DOI [10.1109/percomw.2019.8730793, 10.1109/PERCOMW.2019.8730793]