Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引：22

作者：

Chen, Gong ^{[1
,2
,3
]}

Zhai, Xiangping Bryce ^{[1
,2
]}

Li, Congduan ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China

[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China

[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2023年 / 22卷 / 05期

基金：

美国国家科学基金会;

关键词：

Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;

D O I：

10.1109/TWC.2022.3216049

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.

引用

页码：3128 / 3143

页数：16

共 50 条

[31] UAV-Aided Cooperative Data Collection Scheme for Ocean Monitoring Networks
Ma, Ruofei
Wang, Ruisong
Liu, Gongliang
Meng, Weixiao
Liu, Xiqing
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (17) : 13222 - 13236
[32] Resource Allocation and Trajectory Design in UAV-Aided Cellular Networks Based on Multiagent Reinforcement Learning
Yin, Sixing
Yu, F. Richard
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (04) : 2933 - 2943
[33] Joint Resource Allocation and Trajectory Optimization With QoS in UAV-Based NOMA Wireless Networks
Li, Yabo
Zhang, Haijun
Long, Keping
Jiang, Chunxiao
Guizani, Mohsen
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (10) : 6343 - 6355
[34] Multitask Transfer Deep Reinforcement Learning for Timely Data Collection in Rechargeable-UAV-Aided IoT Networks
Yi, Mengjie
Wang, Xijun
Liu, Juan
Zhang, Yan
Hou, Ronghui
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 20545 - 20559
[35] Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization
Yuan, Yaxiong
Lei, Lei
Vu, Thang X.
Chatzinotas, Symeon
Sun, Sumei
Ottersten, Bjorn
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (05) : 5028 - 5042
[36] Joint Optimization of Multi-UAV Deployment and User Association via Deep Reinforcement Learning for Long-Term Communication Coverage
Cheng, Xu
Jiang, Rong
Sang, Hongrui
Li, Gang
He, Bin
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[37] Deep-Reinforcement-Learning-Based Optimal Transmission Policies for Opportunistic UAV-Aided Wireless Sensor Network
Liu, Yitong
Yan, Junjie
Zhao, Xiaohui
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (15) : 13823 - 13836
[38] Multi-Agent Deep Reinforcement Learning for Joint Decoupled User Association and Trajectory Design in Full-Duplex Multi-UAV Networks
Dai, Chen
Zhu, Kun
Hossain, Ekram
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (10) : 6056 - 6070
[39] UAV-Aided Dual-User Wireless Power Transfer: 3D Trajectory Design and Energy Optimization
Gou, Xiaogang
Sun, Zhaojie
Huang, Kaiyuan
SENSORS, 2023, 23 (06)
[40] Trajectory optimization for the UAV assisted data collection in wireless sensor networks
Saxena, Kartik
Gupta, Nitin
Gupta, Jahnvi
Sharma, Deepak Kumar
Dev, Kapal
WIRELESS NETWORKS, 2022, 28 (04) : 1785 - 1796

← 1 2 3 4 5 →