Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引:30
作者
Chen, Gong [1 ,2 ,3 ]
Zhai, Xiangping Bryce [1 ,2 ]
Li, Congduan [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;
D O I
10.1109/TWC.2022.3216049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.
引用
收藏
页码:3128 / 3143
页数:16
相关论文
共 39 条
[1]   Internet of Things: A Survey on Enabling Technologies, Protocols, and Applications [J].
Al-Fuqaha, Ala ;
Guizani, Mohsen ;
Mohammadi, Mehdi ;
Aledhari, Mohammed ;
Ayyash, Moussa .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (04) :2347-2376
[2]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[3]   Joint Task Assignment and Spectrum Allocation in Heterogeneous UAV Communication Networks: A Coalition Formation Game-Theoretic Approach [J].
Chen, Jiaxin ;
Wu, Qihui ;
Xu, Yuhua ;
Qi, Nan ;
Guan, Xin ;
Zhang, Yuli ;
Xue, Zhen .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :440-452
[4]   Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey [J].
Chen, Wuhui ;
Qiu, Xiaoyu ;
Cai, Ting ;
Dai, Hong-Ning ;
Zheng, Zibin ;
Zhang, Yan .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03) :1659-1692
[5]   UAV-Relaying-Assisted Secure Transmission With Caching [J].
Cheng, Fen ;
Gui, Guan ;
Zhao, Nan ;
Chen, Yunfei ;
Tang, Jie ;
Sari, Hikmet .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (05) :3140-3153
[6]   Trajectory Design and Access Control for Air-Ground Coordinated Communications System With Multiagent Deep Reinforcement Learning [J].
Ding, Ruijin ;
Xu, Yadong ;
Gao, Feifei ;
Shen, Xuemin .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08) :5785-5798
[7]   3D UAV Trajectory Design and Frequency Band Allocation for Energy-Efficient and Fair Communication: A Deep Reinforcement Learning Approach [J].
Ding, Ruijin ;
Gao, Feifei ;
Shen, Xuemin .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (12) :7796-7809
[8]   Energy-Efficient UAV-Enabled Data Collection via Wireless Charging: A Reinforcement Learning Approach [J].
Fu, Shu ;
Tang, Yujie ;
Wu, Yuan ;
Zhang, Ning ;
Gu, Huaxi ;
Chen, Chen ;
Liu, Min .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) :10209-10219
[9]   Energy-Aware Trajectory Design for Outage Minimization in UAV-Assisted Communication Systems [J].
Gupta, Nishant ;
Mishra, Deepak ;
Agarwal, Satyam .
IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2022, 6 (03) :1751-1763
[10]  
Ji J., 2022, IEEE Trans. Mob. Comput.