Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引:22
作者
Chen, Gong [1 ,2 ,3 ]
Zhai, Xiangping Bryce [1 ,2 ]
Li, Congduan [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;
D O I
10.1109/TWC.2022.3216049
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.
引用
收藏
页码:3128 / 3143
页数:16
相关论文
共 50 条
  • [21] Energy-Efficient and Fast Data Collection in UAV-Aided Wireless Sensor Networks for Hilly Terrains
    Nazib, Rezoan Ahmed
    Moh, Sangman
    IEEE ACCESS, 2021, 9 : 23168 - 23190
  • [22] Joint Flight Cruise Control and Data Collection in UAV-Aided Internet of Things: An Onboard Deep Reinforcement Learning Approach
    Li, Kai
    Ni, Wei
    Tovar, Eduardo
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 9787 - 9799
  • [23] Joint 3D Trajectory and Power Optimization for UAV-Aided mmWave MIMO-NOMA Networks
    Feng, Wanmei
    Zhao, Nan
    Ao, Shaopeng
    Tang, Jie
    Zhang, Xiuyin
    Fu, Yuli
    So, Daniel Ka Chun
    Wong, Kai-Kit
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (04) : 2346 - 2358
  • [24] UAV-Aided Wireless Power Transfer and Data Collection in Rician Fading
    Liu, Yuan
    Xiong, Ke
    Lu, Yang
    Ni, Qiang
    Fan, Pingyi
    Ben Letaief, Khaled
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (10) : 3097 - 3113
  • [25] Joint Optimization of UAV Trajectory and Sensor Uploading Powers for UAV-Assisted Data Collection in Wireless Sensor Networks
    Wang, Yinlu
    Chen, Ming
    Pan, Cunhua
    Wang, Kezhi
    Pan, Yijin
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (13) : 11214 - 11226
  • [26] Energy-Efficient 3D Trajectory Optimization for UAV-Aided Wireless Sensor Networks
    Ma, Yue
    Tang, Yanqun
    Mao, Zhongjun
    Zhang, Di
    Yang, Chao
    Li, Wei
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6591 - 6596
  • [27] Joint flight scheduling and task allocation for secure data collection in UAV-aided IoTs
    Wang, Zuyan
    Tao, Jun
    Gao, Yang
    Xu, Yifan
    Sun, Weice
    Gao, Yu
    Li, Wenqiang
    COMPUTER NETWORKS, 2022, 207
  • [28] A Deep Reinforcement Learning Approach to Energy-harvesting UAV-aided Data Collection
    Zhang, Ning
    Liu, Juan
    Xie, Lingfu
    Tong, Peng
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 93 - 98
  • [29] Joint User Scheduling, Power Configuration and Trajectory Planning Strategy for UAV-Aided WSNs
    Wang, Xindi
    Liu, Xinyu
    Wu, Jianjian
    Ju, Wei
    Chen, Xiaojing
    Shen, Ling
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2023, 19 (01)
  • [30] Joint Optimization of Trajectory Control, Resource Allocation, and User Association Based on DRL for Multi-Fixed-Wing UAV Networks
    Yin, Baolin
    Fang, Xuming
    Wang, Xianbin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 13330 - 13343