Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization

被引:36
作者
Yuan, Yaxiong [1 ]
Lei, Lei [1 ]
Vu, Thang X. [1 ]
Chatzinotas, Symeon [1 ]
Sun, Sumei [2 ]
Ottersten, Bjorn [1 ]
机构
[1] Luxembourg Univ, Interdisciplinary Ctr Secur Reliabil & Trust, L-1855 Kirchberg, Luxembourg
[2] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 138632, Singapore
关键词
Optimization; Trajectory; Heuristic algorithms; Unmanned aerial vehicles; Resource management; Propulsion; Task analysis; UAV; deep reinforcement learning; user scheduling; hovering time allocation; energy optimization; actor-critic; TRAJECTORY OPTIMIZATION; RESOURCE-ALLOCATION; FAIR COMMUNICATION; EFFICIENT; CHANNEL; DESIGN; RADIO;
D O I
10.1109/TVT.2021.3075860
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In unmanned aerial vehicle (UAV) applications, the UAV's limited energy supply and storage have triggered the development of intelligent energy-conserving scheduling solutions. In this paper, we investigate energy minimization for UAV-aided communication networks by jointly optimizing data-transmission scheduling and UAV hovering time. The formulated problem is combinatorial and non-convex with bilinear constraints. To tackle the problem, firstly, we provide an optimal algorithm (OPT) and a golden section search heuristic algorithm (GSS-HEU). Both solutions are served as offline performance benchmarks which might not be suitable for online operations. Towards this end, from a deep reinforcement learning (DRL) perspective, we propose an actor-critic-based deep stochastic online scheduling (AC-DSOS) algorithm and develop a set of approaches to confine the action space. Compared to conventional RL/DRL, the novelty of AC-DSOS lies in handling two major issues, i.e., exponentially-increased action space and infeasible actions. Numerical results show that AC-DSOS is able to provide feasible solutions, and save around 25-30% energy compared to two conventional deep AC-DRL algorithms. Compared to the developed GSS-HEU, AC-DSOS consumes around 10% higher energy but reduces the computational time from second-level to millisecond-level.
引用
收藏
页码:5028 / 5042
页数:15
相关论文
共 49 条
[21]   Co-deposition of silicon with rare earth elements (REEs) and aluminium in the fern Dicranopteris linearis from China [J].
Liu, Wen-Shen ;
Zheng, Hong-Xiang ;
Guo, Mei-Na ;
Liu, Chang ;
Huot, Hermine ;
Morel, Jean Louis ;
van der Ent, Antony ;
Tang, Ye-Tao ;
Qiu, Rong-Liang .
PLANT AND SOIL, 2019, 437 (1-2) :427-437
[22]  
Lu Y, 2018, AER ADV ENG RES, V176, P1
[23]   COMPUTABILITY OF GLOBAL SOLUTIONS TO FACTORABLE NONCONVEX PROGRAMS .1. CONVEX UNDERESTIMATING PROBLEMS [J].
MCCORMICK, GP .
MATHEMATICAL PROGRAMMING, 1976, 10 (02) :147-175
[24]   A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems [J].
Mozaffari, Mohammad ;
Saad, Walid ;
Bennis, Mehdi ;
Nam, Young-Han ;
Debbah, Merouane .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2019, 21 (03) :2334-2360
[25]  
Schulman J., 2016, P INT C LEARN REPR I, P1
[26]  
Shengxiang Zhu, 2018, 2018 IEEE 4th International Conference on Computer and Communications (ICCC). Proceedings, P158, DOI 10.1109/CompComm.2018.8780803
[27]   Drone-Cell Trajectory Planning and Resource Allocation for Highly Mobile Networks: A Hierarchical DRL Approach [J].
Shi, Weisen ;
Li, Junling ;
Wu, Huaqing ;
Zhou, Conghao ;
Cheng, Nan ;
Shen, Xuemin .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) :9800-9813
[28]   Auction-Based Charging Scheduling With Deep Learning Framework for Multi-Drone Networks [J].
Shin, MyungJae ;
Kim, Joongheon ;
Levorato, Marco .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) :4235-4248
[29]  
Silver D, 2014, PR MACH LEARN RES, V32
[30]   Energy Efficient Multi-Antenna UAV-Enabled Mobile Relay [J].
Song, Qingheng ;
Zheng, Fuchun .
CHINA COMMUNICATIONS, 2018, 15 (05) :41-50