Joint Resource Allocation and Trajectory Design for Multi-UAV Systems With Moving Users: Pointer Network and Unfolding

被引:10
|
作者
Hou, Qiushuo [1 ,2 ]
Cai, Yunlong [1 ,2 ]
Hu, Qiyu [1 ,2 ]
Lee, Mengyuan [1 ,2 ]
Yu, Guanding [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Prov Key Lab Informat Proc Commun & Netwo, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial neural networks; Trajectory; Resource management; Optimization; Communication systems; Autonomous aerial vehicles; Trajectory optimization; Multi-UAV; resource optimization; trajectory design; pointer network; deep reinforcement learning; deep-unfolding; REINFORCEMENT LEARNING APPROACH; NEURAL-NETWORKS; DEEP; MIMO;
D O I
10.1109/TWC.2022.3217176
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As an important part of the fifth generation (5G) mobile networks, unmanned aerial vehicles (UAVs) have been applied in various communication scenarios due to their high operability and low cost. In this paper, we investigate a multi-UAV communication system with moving users and consider the co-channel interference caused by the transmissions of all other UAVs. To ensure the fairness, we maximize the minimum average user rate during the observed time by jointly optimizing UAVs' trajectories, transmission power, and user association. Considering that UAVs can cover a large area for communications, UAVs do not need to move as soon as the users move. Therefore, a two-timescale structure is proposed for the considered scenario, where the UAVs' trajectories are optimized based on the channel state information (CSI) in a long timescale, while the transmission power and the user association are optimized based on the instantaneous CSI in a short timescale. To effectively tackle this challenging non-convex problem with both discrete and continuous variables, we propose a joint neural network (NN) design, where a deep reinforcement learning based Pointer Network named advantage pointer-critic (APC) is applied to optimize discrete variables and a deep-unfolding NN is used to optimize the continuous variables. Specifically, we first formulate a Markov decision process to model the user association, and then employ the APC network trained by the advantage actor-critic algorithm to address it. The APC network consists of a Pointer Network and a Multilayer Perceptron. As for the deep-unfolding NN, we first develop a block coordinate descent based algorithm to optimize the UAVs' trajectories and transmission power, and then unfold the algorithm into a layer-wise NN with introduced trainable parameters. These two networks are jointly trained in an unsupervised fashion. Simulation results validate that the proposed joint NN significantly outperforms the optimization algorithm with much lower complexity, and achieves good performances on scalability and generalization ability.
引用
收藏
页码:3310 / 3323
页数:14
相关论文
共 50 条
  • [1] Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches
    Chang, Zheng
    Deng, Hengwei
    You, Li
    Min, Geyong
    Garg, Sahil
    Kaddoum, Georges
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2940 - 2951
  • [2] Joint Task Offloading, Resource Allocation, and Trajectory Design for Multi-UAV Cooperative Edge Computing With Task Priority
    Hao, Hao
    Xu, Changqiao
    Zhang, Wei
    Yang, Shujie
    Muntean, Gabriel-Miro
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (09) : 8649 - 8663
  • [3] Trajectory Planning and Resource Allocation for Multi-UAV Cooperative Computation
    Xu, Wenlong
    Zhang, Tiankui
    Mu, Xidong
    Liu, Yuanwei
    Wang, Yapeng
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (07) : 4305 - 4318
  • [4] Trajectory Design and Resource Allocation for Multi-UAV Communications Under Blockage-Aware Channel Model
    Yi, Pengfei
    Zhu, Lipeng
    Xiao, Zhenyu
    Zhang, Rui
    Han, Zhu
    Xia, Xiang-Gen
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (04) : 2324 - 2338
  • [5] Joint Resource Allocation and Trajectory Design for Resilient Multi-UAV Communication Networks
    Ge, Linghui
    Liang, Xiao
    Zhang, Hua
    Dong, Peihao
    Liao, Jianxin
    Wang, Jingyu
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (04) : 994 - 998
  • [6] Joint Trajectory Design and Resource Allocation in UAV-Enabled Heterogeneous MEC Systems
    Liu, Wenchao
    Wang, Hao
    Zhang, Xuhui
    Xing, Huijun
    Ren, Jinke
    Shen, Yanyan
    Cui, Shuguang
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (19): : 30817 - 30832
  • [7] Joint Trajectory and Resource Allocation Design for Energy-Efficient Secure UAV Communication Systems
    Cai, Yuanxin
    Wei, Zhiqiang
    Li, Ruide
    Ng, Derrick Wing Kwan
    Yuan, Jinhong
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (07) : 4536 - 4553
  • [8] Energy-Efficient Resource Allocation and Trajectory Design for UAV Relaying Systems
    Zhang, Tong
    Liu, Gongliang
    Zhang, Haijun
    Kang, Wenjing
    Karagiannidis, George K.
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (10) : 6483 - 6498
  • [9] Multi-UAV Clustered NOMA for Covert Communications: Joint Resource Allocation and Trajectory Optimization
    Qin, Xiaofei
    Wu, Xu
    Xiong, Mudi
    Liu, Ye
    Zhang, Yue
    ELECTRONICS, 2022, 11 (23)
  • [10] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
    Zhao, Nan
    Liu, Zehua
    Cheng, Yiqiang
    IEEE ACCESS, 2020, 8 : 139670 - 139679