Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization

被引:36
作者
Yuan, Yaxiong [1 ]
Lei, Lei [1 ]
Vu, Thang X. [1 ]
Chatzinotas, Symeon [1 ]
Sun, Sumei [2 ]
Ottersten, Bjorn [1 ]
机构
[1] Luxembourg Univ, Interdisciplinary Ctr Secur Reliabil & Trust, L-1855 Kirchberg, Luxembourg
[2] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 138632, Singapore
关键词
Optimization; Trajectory; Heuristic algorithms; Unmanned aerial vehicles; Resource management; Propulsion; Task analysis; UAV; deep reinforcement learning; user scheduling; hovering time allocation; energy optimization; actor-critic; TRAJECTORY OPTIMIZATION; RESOURCE-ALLOCATION; FAIR COMMUNICATION; EFFICIENT; CHANNEL; DESIGN; RADIO;
D O I
10.1109/TVT.2021.3075860
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In unmanned aerial vehicle (UAV) applications, the UAV's limited energy supply and storage have triggered the development of intelligent energy-conserving scheduling solutions. In this paper, we investigate energy minimization for UAV-aided communication networks by jointly optimizing data-transmission scheduling and UAV hovering time. The formulated problem is combinatorial and non-convex with bilinear constraints. To tackle the problem, firstly, we provide an optimal algorithm (OPT) and a golden section search heuristic algorithm (GSS-HEU). Both solutions are served as offline performance benchmarks which might not be suitable for online operations. Towards this end, from a deep reinforcement learning (DRL) perspective, we propose an actor-critic-based deep stochastic online scheduling (AC-DSOS) algorithm and develop a set of approaches to confine the action space. Compared to conventional RL/DRL, the novelty of AC-DSOS lies in handling two major issues, i.e., exponentially-increased action space and infeasible actions. Numerical results show that AC-DSOS is able to provide feasible solutions, and save around 25-30% energy compared to two conventional deep AC-DRL algorithms. Compared to the developed GSS-HEU, AC-DSOS consumes around 10% higher energy but reduces the computational time from second-level to millisecond-level.
引用
收藏
页码:5028 / 5042
页数:15
相关论文
共 49 条
[41]   3D Trajectory Optimization in Rician Fading for UAV-Enabled Data Harvesting [J].
You, Changsheng ;
Zhang, Rui .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (06) :3192-3207
[42]   Reinforcement Learning-Based Collision Avoidance and Optimal Trajectory Planning in UAV Communication Networks [J].
Hsu, Yu-Hsin ;
Gau, Rung-Hung .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (01) :306-320
[43]  
Yuan YX, 2020, EUR CONF NETW COMMUN, P348, DOI [10.1109/EuCNC48522.2020.9200931, 10.1109/eucnc48522.2020.9200931]
[44]   Matching Based Two-Timescale Resource Allocation for Cooperative D2D Communication [J].
Yuan, Yiling ;
Yang, Tao ;
Hu, Yulin ;
Feng, Hui ;
Hu, Bo .
2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
[45]   Energy Minimization for Wireless Communication With Rotary-Wing UAV [J].
Zeng, Yong ;
Xu, Jie ;
Zhang, Rui .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (04) :2329-2345
[46]   Spectrum and Energy Efficiency Maximization in UAV-Enabled Mobile Relaying [J].
Zhang, Jingwei ;
Zeng, Yong ;
Zhang, Rui .
2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
[47]   Cellular-Enabled UAV Communication: A Connectivity-Constrained Trajectory Optimization Perspective [J].
Zhang, Shuowen ;
Zeng, Yong ;
Zhang, Rui .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (03) :2580-2604
[48]   Computation Rate Maximization in UAV-Enabled Wireless-Powered Mobile-Edge Computing Systems [J].
Zhou, Fuhui ;
Wu, Yongpeng ;
Hu, Rose Qingyang ;
Qian, Yi .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :1927-1941
[49]   Robust Max-Min Fairness Resource Allocation in Sensing-Based Wideband Cognitive Radio With SWIPT: Imperfect Channel Sensing [J].
Zhou, Fuhui ;
Beaulieu, Norman C. ;
Cheng, Julian ;
Chu, Zheng ;
Wang, Yuhao .
IEEE SYSTEMS JOURNAL, 2018, 12 (03) :2361-2372