A3C-DO: A Regional Resource Scheduling Framework Based on Deep Reinforcement Learning in Edge Scenario

被引:64
作者
Zou, Junfeng [1 ]
Hao, Tongbo [1 ]
Yu, Chen [1 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Serv Comp Technol & Syst Lab, Big Data Technol & Syst Lab, Natl Engn Res Ctr,Cluster & GridComp Lab,Sch Comp, Wuhan 430074, Peoples R China
关键词
Edge computing; resource scheduling; computation offloading; deep reinforcement learning; wireless communication; MOBILE; ALLOCATION;
D O I
10.1109/TC.2020.2987567
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, huge amounts of data are produced by edge device. Considering the heavy burden of network bandwidth and the service delay requirements of delay-sensitive applications, processing the data at network edge is a great choice. However, edge devices such as smart wearables, connected and autonomous vehicles usually have several limitations on computational capacity and energy which will influence the quality of service. As an effective and efficient strategy, offloading is widely used to address this issue. But when facing device heterogeneity problem and task complexity increase, service quality degradation and resource utility decrease often occur due to unreasonable task distribution. Since conventional simplex offloading strategies show limited performance in complex environment, we are motivated to design a dynamic regional resource scheduling framework which is able to work effectively taking different indexes into consideration. Thus, in this article we first propose a double offloading framework to simulate the offloading process in real edge scenario which consists of different edge servers and devices. Then we formulate the offloading as a Markov Decision Process (MDP) and utilize a deep reinforcement learning (DRL) algorithm named asynchronous advantage actor-critic (A3C) as the offloading decision making strategy to balance the workload of edge servers and finally reduce the overhead in terms of energy and time. Comparison experiments for local computing and wide-used DRL algorithm DQN are conducted in a comprehensive benchmark and the results show that our work performs much better on self-adjusting and overhead reduction.
引用
收藏
页码:228 / 239
页数:12
相关论文
共 27 条
[1]   HD Live Maps for Automated Driving: An AI Approach [J].
Chen, Xin .
26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, :1-1
[2]   An adaptive offloading framework for Android applications in mobile edge computing [J].
Chen, Xing ;
Chen, Shihong ;
Ma, Yun ;
Liu, Bichun ;
Zhang, Ying ;
Huang, Gang .
SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (08)
[3]   Decentralized Computation Offloading Game for Mobile Cloud Computing [J].
Chen, Xu .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (04) :974-983
[4]  
Cuervo E., 2010, Proceedings of the 8th international conference on Mobile systems, applications, and services (MobiSys), P49, DOI [10.1145/1814433.1814441, DOI 10.1145/1814433.1814441]
[5]   Energy-Efficient Dynamic Computation Offloading and Cooperative Task Scheduling in Mobile Cloud Computing [J].
Guo, Songtao ;
Liu, Jiadi ;
Yang, Yuanyuan ;
Xiao, Bin ;
Li, Zhetao .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2019, 18 (02) :319-333
[6]  
Holden J., 2016, Fast-forwarding to a future of on-demand urban air transportation
[7]   A Dynamic Offloading Algorithm for Mobile Computing [J].
Huang, Dong ;
Wang, Ping ;
Niyato, Dusit .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2012, 11 (06) :1991-1995
[8]   Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks [J].
Huang, Liang ;
Bi, Suzhi ;
Zhang, Ying-Jun Angela .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2020, 19 (11) :2581-2593
[9]   Computation Offloading for Machine Learning Web Apps in the Edge Server Environment [J].
Jeong, Hyuk-Jin ;
Jeong, InChang ;
Lee, Hyeon-Jae ;
Moon, Soo-Mook .
2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, :1492-1499
[10]  
Khandekar Aamod, 2010, 2010 European Wireless Conference (EW), P978, DOI 10.1109/EW.2010.5483516