Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引:22
作者
Chen, Gong [1 ,2 ,3 ]
Zhai, Xiangping Bryce [1 ,2 ]
Li, Congduan [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;
D O I
10.1109/TWC.2022.3216049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.
引用
收藏
页码:3128 / 3143
页数:16
相关论文
共 50 条
  • [1] Joint User Scheduling and UAV Trajectory Design on Completion Time Minimization for UAV-Aided Data Collection
    Yuan, Xiaopeng
    Hu, Yulin
    Zhang, Jian
    Schmeink, Anke
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (06) : 3884 - 3898
  • [2] Energy-Efficient Multidimensional Trajectory of UAV-Aided IoT Networks With Reinforcement Learning
    Silvirianti
    Shin, Soo Young
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19): : 19214 - 19226
  • [3] Joint Trajectory and Resource Optimization for UAV-Aided Two-Way Relay Networks
    Liang, Yipeng
    Xiao, Lin
    Yang, Dingcheng
    Liu, Yuanwei
    Zhang, Tiankui
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) : 639 - 652
  • [4] Joint Optimization of Trajectory and Node Access in UAV-Aided Data Collection System
    Han, Dongsheng
    Shi, Tianhao
    Han, Tianyu
    Zhou, Zhenyu
    IEEE SYSTEMS JOURNAL, 2023, 17 (02): : 2574 - 2585
  • [5] Joint Cluster Head Selection and Trajectory Planning in UAV-Aided IoT Networks by Reinforcement Learning With Sequential Model
    Zhu, Botao
    Bedeer, Ebrahim
    Nguyen, Ha H.
    Barton, Robert
    Henry, Jerome
    IEEE INTERNET OF THINGS JOURNAL, 2021, 9 (14): : 12071 - 12084
  • [6] Joint Trajectory Design and Radio Resource Management for UAV-Aided Vehicular Networks
    Spampinato, Leonardo
    Ferretti, Danila
    Buratti, Chiara
    Marini, Riccardo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (01) : 847 - 860
  • [7] UAV-Aided Wireless Powered Communication Networks: Trajectory Optimization and Resource Allocation for Minimum Throughput Maximization
    Park, Junhee
    Lee, Hoon
    Eom, Subin
    Lee, Inkyu
    IEEE ACCESS, 2019, 7 : 134978 - 134991
  • [8] Deep Reinforcement Learning Based Trajectory Design for Customized UAV-Aided NOMA Data Collection
    Zhang, Lei
    Zhang, Yuandi
    Lu, Jiawangnan
    Xiao, Yunfa
    Zhang, Guanglin
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (12) : 3365 - 3369
  • [9] 3D UAV Trajectory and Data Collection Optimisation Via Deep Reinforcement Learning
    Nguyen, Khoi Khac
    Duong, Trung Q.
    Tan Do-Duy
    Claussen, Holger
    Hanzo, Lajos
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (04) : 2358 - 2371
  • [10] Joint Optimization on Trajectory, Altitude, Velocity, and Link Scheduling for Minimum Mission Time in UAV-Aided Data Collection
    Li, Jiaxun
    Zhao, Haitao
    Wang, Haijun
    Gu, Fanglin
    Wei, Jibo
    Yin, Hao
    Ren, Baoquan
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (02) : 1464 - 1475