Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引：22

作者：

Chen, Gong ^{[1
,2
,3
]}

Zhai, Xiangping Bryce ^{[1
,2
]}

Li, Congduan ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China

[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China

[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2023年 / 22卷 / 05期

基金：

美国国家科学基金会;

关键词：

Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;

D O I：

10.1109/TWC.2022.3216049

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.

引用

页码：3128 / 3143

页数：16

共 50 条

[41] Energy-Efficient Trajectory Design for UAV-Aided Maritime Data Collection in Wind
Zhang, Yifan
Lyu, Jiangbin
Fu, Liqun
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (12) : 10871 - 10886
[42] Trajectory Design for UAV-Based Internet of Things Data Collection: A Deep Reinforcement Learning Approach
Wang, Yang
Gao, Zhen
Zhang, Jun
Cao, Xianbin
Zheng, Dezhi
Gao, Yue
Ng, Derrick Wing Kwan
Di Renzo, Marco
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (05): : 3899 - 3912
[43] Energy-Efficient UAV-Enabled Data Collection via Wireless Charging: A Reinforcement Learning Approach
Fu, Shu
Tang, Yujie
Wu, Yuan
Zhang, Ning
Gu, Huaxi
Chen, Chen
Liu, Min
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 10209 - 10219
[44] Reinforcement Learning-Based Trajectory Planning For UAV-aided Vehicular Communications
Marini, Riccardo
Spampinato, Leonardo
Mignardi, Silvia
Verdone, Roberto
Buratti, Chiara
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 967 - 971
[45] Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks
Luo, Xiaoling
Chen, Che
Zeng, Chunnian
Li, Chengtao
Xu, Jing
Gong, Shimin
SENSORS, 2023, 23 (10)
[46] Energy-Effective Data Gathering for UAV-Aided Wireless Sensor Networks
Liu, Bin
Zhu, Hongbo
SENSORS, 2019, 19 (11):
[47] Joint Optimization of UAV Trajectory and Relay Ratio in UAV-Aided Mobile Edge Computation Network
Zhang, Xinhe
Zhang, Heli
Ji, Hong
Li, Xi
2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,
[48] Energy-Efficient Cyclical Trajectory Design for UAV-Aided Maritime Data Collection in Wind
Zhang, Yifan
Lyu, Jiangbin
Fu, Liqun
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[49] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
Xu, Shu
Zhan, Xiangyu
Li, Chunguo
Wang, Dongming
Yang, Luxi
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394
[50] Joint Optimization of Trajectory and Resource Allocation for Multi-UAV-Enabled Wireless-Powered Communication Networks
Kim, Chaeyeon
Choi, Hyun-Ho
Lee, Kisong
IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (09) : 5752 - 5764

← 1 2 3 4 5 →