Joint Resource Allocation and Trajectory Design for Multi-UAV Systems With Moving Users: Pointer Network and Unfolding

被引:10
|
作者
Hou, Qiushuo [1 ,2 ]
Cai, Yunlong [1 ,2 ]
Hu, Qiyu [1 ,2 ]
Lee, Mengyuan [1 ,2 ]
Yu, Guanding [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Prov Key Lab Informat Proc Commun & Netwo, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial neural networks; Trajectory; Resource management; Optimization; Communication systems; Autonomous aerial vehicles; Trajectory optimization; Multi-UAV; resource optimization; trajectory design; pointer network; deep reinforcement learning; deep-unfolding; REINFORCEMENT LEARNING APPROACH; NEURAL-NETWORKS; DEEP; MIMO;
D O I
10.1109/TWC.2022.3217176
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As an important part of the fifth generation (5G) mobile networks, unmanned aerial vehicles (UAVs) have been applied in various communication scenarios due to their high operability and low cost. In this paper, we investigate a multi-UAV communication system with moving users and consider the co-channel interference caused by the transmissions of all other UAVs. To ensure the fairness, we maximize the minimum average user rate during the observed time by jointly optimizing UAVs' trajectories, transmission power, and user association. Considering that UAVs can cover a large area for communications, UAVs do not need to move as soon as the users move. Therefore, a two-timescale structure is proposed for the considered scenario, where the UAVs' trajectories are optimized based on the channel state information (CSI) in a long timescale, while the transmission power and the user association are optimized based on the instantaneous CSI in a short timescale. To effectively tackle this challenging non-convex problem with both discrete and continuous variables, we propose a joint neural network (NN) design, where a deep reinforcement learning based Pointer Network named advantage pointer-critic (APC) is applied to optimize discrete variables and a deep-unfolding NN is used to optimize the continuous variables. Specifically, we first formulate a Markov decision process to model the user association, and then employ the APC network trained by the advantage actor-critic algorithm to address it. The APC network consists of a Pointer Network and a Multilayer Perceptron. As for the deep-unfolding NN, we first develop a block coordinate descent based algorithm to optimize the UAVs' trajectories and transmission power, and then unfold the algorithm into a layer-wise NN with introduced trainable parameters. These two networks are jointly trained in an unsupervised fashion. Simulation results validate that the proposed joint NN significantly outperforms the optimization algorithm with much lower complexity, and achieves good performances on scalability and generalization ability.
引用
收藏
页码:3310 / 3323
页数:14
相关论文
共 50 条
  • [31] Joint Optimization of Resource Allocation and Multi-UAV Trajectory in Space-Air-Ground IoRT Networks
    Liu, Man
    Wang, Ying
    Li, Zhendong
    Lyu, Xinpeng
    Chen, Yuanbin
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2020,
  • [32] Energy-Efficiency Joint Trajectory and Resource Allocation Optimization in Cognitive UAV Systems
    Liang, Xiaopeng
    Deng, Qian
    Shu, Feng
    Wang, Jiangzhou
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (22) : 23058 - 23071
  • [33] Joint Optimization of Trajectory Control, Resource Allocation, and User Association Based on DRL for Multi-Fixed-Wing UAV Networks
    Yin, Baolin
    Fang, Xuming
    Wang, Xianbin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 13330 - 13343
  • [34] Joint Resource Allocation and UAV Trajectory Design for Data Collection in Air-Ground Integrated IoRT Sensors Network With Clustered NOMA
    Li, Shichao
    Yu, Zhiqiang
    Chen, Lian
    IEEE SENSORS JOURNAL, 2024, 24 (22) : 38540 - 38550
  • [35] Joint User Scheduling, Power Allocation, and Trajectory Design for Joint Synthetic Aperture Radar and Communication UAV Systems
    Liu, Ziyi
    Fei, Zesong
    Wang, Xinyi
    Liu, Peng
    Xu, Shanfeng
    Zhou, Jianming
    Yuan, Weijie
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 3006 - 3016
  • [36] Joint Resource Allocation and 3-D Deployment for Multi-UAV Covert Communications
    Mao, Haobin
    Liu, Yanming
    Xiao, Zhenyu
    Han, Zhu
    Xia, Xiang-Gen
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (01): : 559 - 572
  • [37] Joint Subcarrier Allocation, Modulation Mode Selection, and Trajectory Design in a UAV-Based OFDMA Network
    Li, Shichao
    Zhang, Ning
    Chen, Hongbin
    Lin, Siyu
    Wu, Huici
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (09) : 2111 - 2115
  • [38] Reconfigurable Intelligent Surface-Assisted Multi-UAV Networks: Efficient Resource Allocation With Deep Reinforcement Learning
    Khoi Khac Nguyen
    Khosravirad, Saeed R.
    da Costa, Daniel Benevides
    Nguyen, Long D.
    Duong, Trung Q.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (03) : 358 - 368
  • [39] Three-Dimension Trajectory Design for Multi-UAV Wireless Network With Deep Reinforcement Learning
    Zhang, Wenqi
    Wang, Qiang
    Liu, Xiao
    Liu, Yuanwei
    Chen, Yue
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (01) : 600 - 612
  • [40] Collaborative Design of Multi-UAV Trajectory and Resource Scheduling for 6G-Enabled Internet of Things
    Wang, Jun
    Na, Zhenyu
    Liu, Xin
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (20) : 15096 - 15106