Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引:22
作者
Chen, Gong [1 ,2 ,3 ]
Zhai, Xiangping Bryce [1 ,2 ]
Li, Congduan [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;
D O I
10.1109/TWC.2022.3216049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.
引用
收藏
页码:3128 / 3143
页数:16
相关论文
共 50 条
  • [31] UAV-Aided Cooperative Data Collection Scheme for Ocean Monitoring Networks
    Ma, Ruofei
    Wang, Ruisong
    Liu, Gongliang
    Meng, Weixiao
    Liu, Xiqing
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (17) : 13222 - 13236
  • [32] Resource Allocation and Trajectory Design in UAV-Aided Cellular Networks Based on Multiagent Reinforcement Learning
    Yin, Sixing
    Yu, F. Richard
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (04) : 2933 - 2943
  • [33] Joint Resource Allocation and Trajectory Optimization With QoS in UAV-Based NOMA Wireless Networks
    Li, Yabo
    Zhang, Haijun
    Long, Keping
    Jiang, Chunxiao
    Guizani, Mohsen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (10) : 6343 - 6355
  • [34] Multitask Transfer Deep Reinforcement Learning for Timely Data Collection in Rechargeable-UAV-Aided IoT Networks
    Yi, Mengjie
    Wang, Xijun
    Liu, Juan
    Zhang, Yan
    Hou, Ronghui
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 20545 - 20559
  • [35] Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization
    Yuan, Yaxiong
    Lei, Lei
    Vu, Thang X.
    Chatzinotas, Symeon
    Sun, Sumei
    Ottersten, Bjorn
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (05) : 5028 - 5042
  • [36] Joint Optimization of Multi-UAV Deployment and User Association via Deep Reinforcement Learning for Long-Term Communication Coverage
    Cheng, Xu
    Jiang, Rong
    Sang, Hongrui
    Li, Gang
    He, Bin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [37] Deep-Reinforcement-Learning-Based Optimal Transmission Policies for Opportunistic UAV-Aided Wireless Sensor Network
    Liu, Yitong
    Yan, Junjie
    Zhao, Xiaohui
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (15) : 13823 - 13836
  • [38] Multi-Agent Deep Reinforcement Learning for Joint Decoupled User Association and Trajectory Design in Full-Duplex Multi-UAV Networks
    Dai, Chen
    Zhu, Kun
    Hossain, Ekram
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (10) : 6056 - 6070
  • [39] UAV-Aided Dual-User Wireless Power Transfer: 3D Trajectory Design and Energy Optimization
    Gou, Xiaogang
    Sun, Zhaojie
    Huang, Kaiyuan
    SENSORS, 2023, 23 (06)
  • [40] Trajectory optimization for the UAV assisted data collection in wireless sensor networks
    Saxena, Kartik
    Gupta, Nitin
    Gupta, Jahnvi
    Sharma, Deepak Kumar
    Dev, Kapal
    WIRELESS NETWORKS, 2022, 28 (04) : 1785 - 1796