UAV Trajectory, User Association, and Power Control for Multi-UAV-Enabled Energy-Harvesting Communications: Offline Design and Online Reinforcement Learning

被引:11
作者
Fu, Chien-Wei [1 ]
Ku, Meng-Lin [1 ]
Chen, Yu-Jia [1 ]
Quek, Tony Q. S. [2 ]
机构
[1] Natl Cent Univ, Dept Commun Engn, Taoyuan 32001, Taiwan
[2] Singapore Univ Technol & Design, Informat Syst Technol & Design Pillar, Singapore 487372, Singapore
关键词
Autonomous aerial vehicles; Trajectory; Power control; Wireless communication; Resource management; Optimization; Interference; Convex optimization; energy harvesting (EH); power control; reinforcement learning (RL); unmanned aerial vehicle (UAV) trajectory; UAV communication; user association; OPTIMIZATION; ALLOCATION; PERFORMANCE; NETWORKS; ALTITUDE; SKY;
D O I
10.1109/JIOT.2023.3325841
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we consider multiple solar-powered wireless nodes (WNs) which utilize the harvested solar energy to transmit collected data to multiple unmanned aerial vehicles (UAVs) in the uplink. In this context, we jointly design UAV flight trajectories, UAV-node user association, and uplink power control to effectively utilize the harvested energy and manage co-channel interference within a finite time horizon. The design goal is to ensure the fairness of WNs by maximizing the worst user rate. The joint design problem is highly nonconvex and requires causal (future) knowledge of the instantaneous energy state information (ESI) and channel state information (CSI), which are difficult to predict in reality. To overcome these challenges, we propose an offline method based on convex optimization that only utilizes the average ESI and CSI, where line-of-sight (LOS) and non-LOS (NLOS) channels are considered. The problem is solved by three convex subproblems with successive convex approximation (SCA) and alternative optimization. We further design an online convex-assisted reinforcement learning (CARL) method based on real-time environmental information. An idea of multi-UAV regulated flight corridors, based on the optimal offline UAV trajectories, is proposed to avoid unnecessary flight exploration by UAVs and enables us to improve the learning efficiency and system performance, as compared with the conventional reinforcement learning (RL) method. Computer simulations are used to verify the effectiveness of the proposed methods. The proposed CARL method provides 25% and 12% improvement on the worst user rate over the offline and conventional RL methods.
引用
收藏
页码:9781 / 9800
页数:20
相关论文
共 42 条
[1]   Reinforcement Learning for Virtual Network Embedding in Wireless Sensor Networks [J].
Afifi, Haitham ;
Karl, Holger .
2020 16TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB), 2020,
[2]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[3]   Dynamic Aerial Wireless Power Transfer Optimization [J].
Ali, Muntadher A. ;
Jamalipour, Abbas .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (04) :4010-4022
[4]   Mobility in the Sky: Performance and Mobility Analysis for Cellular-Connected UAVs [J].
Amer, Ramy ;
Saad, Walid ;
Marchetti, Nicola .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (05) :3229-3246
[5]  
Ben-Tal A., 2001, LECT MODERN CONVEX O
[6]  
Boyd S., 2009, Convex Optimization
[7]   Power Outage Estimation and Resource Dimensioning for Solar Powered Cellular Base Stations [J].
Chamola, Vinay ;
Sikdar, Biplab .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (12) :5278-5289
[8]   UAV-Aided Information and Energy Transmissions for Cognitive and Sustainable 5G Networks [J].
Che, Yueling ;
Lai, Yabin ;
Luo, Sheng ;
Wu, Kaishun ;
Duan, Lingjie .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (03) :1668-1683
[9]   Multi-UAV Coverage Scheme for Average Capacity Maximization [J].
Chen, Ruirui ;
Li, Xinyan ;
Sun, Yanjing ;
Li, Song ;
Sun, Zhi .
IEEE COMMUNICATIONS LETTERS, 2020, 24 (03) :653-657
[10]   Learn-As-You-Fly: A Distributed Algorithm for Joint 3D Placement and User Association in Multi-UAVs Networks [J].
El Hammouti, Hajar ;
Benjillali, Mustapha ;
Shihada, Basem ;
Alouini, Mohamed-Slim .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (12) :5831-5844