Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引：22

作者：

Chen, Gong ^{[1
,2
,3
]}

Zhai, Xiangping Bryce ^{[1
,2
]}

Li, Congduan ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China

[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China

[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2023年 / 22卷 / 05期

基金：

美国国家科学基金会;

关键词：

Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;

D O I：

10.1109/TWC.2022.3216049

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.

引用

页码：3128 / 3143

页数：16

共 50 条

[21] Energy-Efficient and Fast Data Collection in UAV-Aided Wireless Sensor Networks for Hilly Terrains
Nazib, Rezoan Ahmed
Moh, Sangman
IEEE ACCESS, 2021, 9 : 23168 - 23190
[22] Joint Flight Cruise Control and Data Collection in UAV-Aided Internet of Things: An Onboard Deep Reinforcement Learning Approach
Li, Kai
Ni, Wei
Tovar, Eduardo
Guizani, Mohsen
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 9787 - 9799
[23] Joint 3D Trajectory and Power Optimization for UAV-Aided mmWave MIMO-NOMA Networks
Feng, Wanmei
Zhao, Nan
Ao, Shaopeng
Tang, Jie
Zhang, Xiuyin
Fu, Yuli
So, Daniel Ka Chun
Wong, Kai-Kit
IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (04) : 2346 - 2358
[24] UAV-Aided Wireless Power Transfer and Data Collection in Rician Fading
Liu, Yuan
Xiong, Ke
Lu, Yang
Ni, Qiang
Fan, Pingyi
Ben Letaief, Khaled
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (10) : 3097 - 3113
[25] Joint Optimization of UAV Trajectory and Sensor Uploading Powers for UAV-Assisted Data Collection in Wireless Sensor Networks
Wang, Yinlu
Chen, Ming
Pan, Cunhua
Wang, Kezhi
Pan, Yijin
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (13) : 11214 - 11226
[26] Energy-Efficient 3D Trajectory Optimization for UAV-Aided Wireless Sensor Networks
Ma, Yue
Tang, Yanqun
Mao, Zhongjun
Zhang, Di
Yang, Chao
Li, Wei
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6591 - 6596
[27] Joint flight scheduling and task allocation for secure data collection in UAV-aided IoTs
Wang, Zuyan
Tao, Jun
Gao, Yang
Xu, Yifan
Sun, Weice
Gao, Yu
Li, Wenqiang
COMPUTER NETWORKS, 2022, 207
[28] A Deep Reinforcement Learning Approach to Energy-harvesting UAV-aided Data Collection
Zhang, Ning
Liu, Juan
Xie, Lingfu
Tong, Peng
2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 93 - 98
[29] Joint User Scheduling, Power Configuration and Trajectory Planning Strategy for UAV-Aided WSNs
Wang, Xindi
Liu, Xinyu
Wu, Jianjian
Ju, Wei
Chen, Xiaojing
Shen, Ling
ACM TRANSACTIONS ON SENSOR NETWORKS, 2023, 19 (01)
[30] Joint Optimization of Trajectory Control, Resource Allocation, and User Association Based on DRL for Multi-Fixed-Wing UAV Networks
Yin, Baolin
Fang, Xuming
Wang, Xianbin
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 13330 - 13343

← 1 2 3 4 5 →