Joint Resource Allocation and Trajectory Design for Multi-UAV Systems With Moving Users: Pointer Network and Unfolding

被引:10
|
作者
Hou, Qiushuo [1 ,2 ]
Cai, Yunlong [1 ,2 ]
Hu, Qiyu [1 ,2 ]
Lee, Mengyuan [1 ,2 ]
Yu, Guanding [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Prov Key Lab Informat Proc Commun & Netwo, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial neural networks; Trajectory; Resource management; Optimization; Communication systems; Autonomous aerial vehicles; Trajectory optimization; Multi-UAV; resource optimization; trajectory design; pointer network; deep reinforcement learning; deep-unfolding; REINFORCEMENT LEARNING APPROACH; NEURAL-NETWORKS; DEEP; MIMO;
D O I
10.1109/TWC.2022.3217176
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As an important part of the fifth generation (5G) mobile networks, unmanned aerial vehicles (UAVs) have been applied in various communication scenarios due to their high operability and low cost. In this paper, we investigate a multi-UAV communication system with moving users and consider the co-channel interference caused by the transmissions of all other UAVs. To ensure the fairness, we maximize the minimum average user rate during the observed time by jointly optimizing UAVs' trajectories, transmission power, and user association. Considering that UAVs can cover a large area for communications, UAVs do not need to move as soon as the users move. Therefore, a two-timescale structure is proposed for the considered scenario, where the UAVs' trajectories are optimized based on the channel state information (CSI) in a long timescale, while the transmission power and the user association are optimized based on the instantaneous CSI in a short timescale. To effectively tackle this challenging non-convex problem with both discrete and continuous variables, we propose a joint neural network (NN) design, where a deep reinforcement learning based Pointer Network named advantage pointer-critic (APC) is applied to optimize discrete variables and a deep-unfolding NN is used to optimize the continuous variables. Specifically, we first formulate a Markov decision process to model the user association, and then employ the APC network trained by the advantage actor-critic algorithm to address it. The APC network consists of a Pointer Network and a Multilayer Perceptron. As for the deep-unfolding NN, we first develop a block coordinate descent based algorithm to optimize the UAVs' trajectories and transmission power, and then unfold the algorithm into a layer-wise NN with introduced trainable parameters. These two networks are jointly trained in an unsupervised fashion. Simulation results validate that the proposed joint NN significantly outperforms the optimization algorithm with much lower complexity, and achieves good performances on scalability and generalization ability.
引用
收藏
页码:3310 / 3323
页数:14
相关论文
共 50 条
  • [41] Distributed Safe Multi-Agent Reinforcement Learning: Joint Design of THz-Enabled UAV Trajectory and Channel Allocation
    Termehchi, Atefeh
    Syed, Aisha
    Kennedy, William Sean
    Erol-Kantarci, Melike
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 14172 - 14186
  • [42] Joint Optimization of Trajectory and Resource Allocation in Secure UAV Relaying Communications for Internet of Things
    Na, Zhenyu
    Ji, Chenglan
    Lin, Bin
    Zhang, Ning
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (17) : 16284 - 16296
  • [43] Multi-UAV Replacement and Trajectory Design for Coverage Continuity
    Gupta, Nishant
    Agarwal, Satyam
    Mishra, Deepak
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022,
  • [44] Joint Trajectory and Hybrid Beamforming Design for Multi Antenna UAV Enabled Network
    Zhou, Feng
    Wang, Rugang
    IEEE ACCESS, 2021, 9 : 49131 - 49140
  • [45] Joint Trajectory Design and Resource Allocation for Energy-Efficient UAV enabled eLAA Network
    Xu, Chan
    Chen, Qimei
    Li, Deshi
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [46] Resource Allocation and Trajectory Design for MISO UAV-Assisted MEC Networks
    Liu, Boyang
    Wan, Yiyao
    Zhou, Fuhui
    Wu, Qihui
    Hu, Rose Qingyang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (05) : 4933 - 4948
  • [47] Efficient Resource Allocation for Multi-UAV Communication Against Adjacent and Co-Channel Interference
    Zhou, Lingyun
    Chen, Xihan
    Hong, Mingyi
    Jin, Shi
    Shi, Qingjiang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (10) : 10222 - 10235
  • [48] Joint Optimization of Trajectory and Communication in Multi-UAV Assisted Backscatter Communication Networks
    Du, Yu
    Chen, Zijing
    Hao, Jianjun
    Guo, Yijun
    IEEE ACCESS, 2022, 10 : 40861 - 40871
  • [49] Joint User Association, Resource Allocation and Trajectory Design for Multi-UAV-Aided NOMA Wireless Communication Systems
    Yin, Sixing
    Qu, Zhaowei
    Yu, Peng
    CHINA COMMUNICATIONS, 2025, 22 (03) : 217 - 233
  • [50] Joint Trajectory Design and Power Allocation for UAV Assisted Network With User Mobility
    Wang, Jing
    Zhou, Xiaotian
    Zhang, Haixia
    Yuan, Dongfeng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (10) : 13173 - 13189