Joint Resource Allocation and Trajectory Design for Multi-UAV Systems With Moving Users: Pointer Network and Unfolding

被引:10
|
作者
Hou, Qiushuo [1 ,2 ]
Cai, Yunlong [1 ,2 ]
Hu, Qiyu [1 ,2 ]
Lee, Mengyuan [1 ,2 ]
Yu, Guanding [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Prov Key Lab Informat Proc Commun & Netwo, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial neural networks; Trajectory; Resource management; Optimization; Communication systems; Autonomous aerial vehicles; Trajectory optimization; Multi-UAV; resource optimization; trajectory design; pointer network; deep reinforcement learning; deep-unfolding; REINFORCEMENT LEARNING APPROACH; NEURAL-NETWORKS; DEEP; MIMO;
D O I
10.1109/TWC.2022.3217176
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As an important part of the fifth generation (5G) mobile networks, unmanned aerial vehicles (UAVs) have been applied in various communication scenarios due to their high operability and low cost. In this paper, we investigate a multi-UAV communication system with moving users and consider the co-channel interference caused by the transmissions of all other UAVs. To ensure the fairness, we maximize the minimum average user rate during the observed time by jointly optimizing UAVs' trajectories, transmission power, and user association. Considering that UAVs can cover a large area for communications, UAVs do not need to move as soon as the users move. Therefore, a two-timescale structure is proposed for the considered scenario, where the UAVs' trajectories are optimized based on the channel state information (CSI) in a long timescale, while the transmission power and the user association are optimized based on the instantaneous CSI in a short timescale. To effectively tackle this challenging non-convex problem with both discrete and continuous variables, we propose a joint neural network (NN) design, where a deep reinforcement learning based Pointer Network named advantage pointer-critic (APC) is applied to optimize discrete variables and a deep-unfolding NN is used to optimize the continuous variables. Specifically, we first formulate a Markov decision process to model the user association, and then employ the APC network trained by the advantage actor-critic algorithm to address it. The APC network consists of a Pointer Network and a Multilayer Perceptron. As for the deep-unfolding NN, we first develop a block coordinate descent based algorithm to optimize the UAVs' trajectories and transmission power, and then unfold the algorithm into a layer-wise NN with introduced trainable parameters. These two networks are jointly trained in an unsupervised fashion. Simulation results validate that the proposed joint NN significantly outperforms the optimization algorithm with much lower complexity, and achieves good performances on scalability and generalization ability.
引用
收藏
页码:3310 / 3323
页数:14
相关论文
共 50 条
  • [21] Layerwise Quantum Deep Reinforcement Learning for Joint Optimization of UAV Trajectory and Resource Allocation
    Silvirianti
    Narottama, Bhaskara
    Shin, Soo Young
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (01) : 430 - 443
  • [22] DRL-Based Resource Allocation and Trajectory Planning for NOMA-Enabled Multi-UAV Collaborative Caching 6G Network
    Qin, Peng
    Fu, Yang
    Zhang, Jing
    Geng, Suiyan
    Liu, Jiayan
    Zhao, Xiongwen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 8750 - 8764
  • [23] Joint Subcarrier and Power Allocation for Multi-UAV Systems
    Xin Guan
    Yang Huang
    Qingjiang Shi
    中国通信, 2019, 16 (01) : 47 - 56
  • [24] Resource Allocation and Trajectory Optimization in Multi-UAV Collaborative Vehicular Networks: An Extended Multiagent DRL Approach
    Zhang, Wenqian
    Tan, Lu
    Huang, Tao
    Huang, Xiaowen
    Huang, Mengting
    Zhang, Guanglin
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (08): : 9391 - 9404
  • [25] Deep reinforcement learning based trajectory design and resource allocation for task-aware multi-UAV enabled MEC networks
    Li, Zewu
    Xu, Chen
    Zhang, Zhanpeng
    Wu, Runze
    COMPUTER COMMUNICATIONS, 2024, 213 : 88 - 98
  • [26] Joint Communication and Trajectory Optimization for Multi-UAV Enabled Mobile Internet of Vehicles
    Liu, Xin
    Lai, Biaojun
    Lin, Bin
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15354 - 15366
  • [27] Energy Consumption Minimization in UAV-Assisted Mobile-Edge Computing Systems: Joint Resource Allocation and Trajectory Design
    Ji, Jiequ
    Zhu, Kun
    Yi, Changyan
    Niyato, Dusit
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (10) : 8570 - 8584
  • [28] Joint Task Allocation and Trajectory Optimization for Multi-UAV Collaborative Air-Ground Edge Computing
    Qin, Peng
    Li, Jinghan
    Zhang, Jing
    Fu, Yang
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 6231 - 6243
  • [29] Joint Resource Allocation and 3D Aerial Trajectory Design for Video Streaming in UAV Communication Systems
    Zhan, Cheng
    Hu, Han
    Sui, Xiufeng
    Liu, Zhi
    Wang, Jianan
    Wang, Honggang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3227 - 3241
  • [30] Analytical Optimal Joint Resource Allocation and Continuous Trajectory Design for UAV-Assisted Covert Communications
    Huang, Yuxi
    Hu, Yulin
    Yuan, Xiaopeng
    Schmeink, Anke
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (01) : 213 - 227