Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks

被引:9
|
作者
Chen, Binqiang [1 ]
Liu, Dong [1 ]
Hanzo, Lajos [2 ]
机构
[1] Beihang Univ, Beijing, Peoples R China
[2] Unveristy Southampton, Southampton, Hants, England
来源
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022) | 2022年
基金
中国国家自然科学基金;
关键词
UAV; multi-agent deep reinforcement learning; MADDPG; power allocation; trajectory planning; UNMANNED AERIAL VEHICLES;
D O I
10.1109/ICC45855.2022.9838637
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Unmanned aerial vehicles (UAVs) are capable of enhancing the coverage of existing cellular networks by acting as aerial base stations (ABSs). Due to the limited on-board battery capacity and dynamic topology of UAV networks, trajectory planning and interference coordination are crucial for providing satisfactory service, especially in emergency scenarios, where it is unrealistic to control all UAVs in a centralized manner by gathering global user information. Hence, we solve the decentralized joint trajectory and transmit power control problem of multi-UAV ABS networks. Our goal is to maximize the number of satisfied users, while minimizing the overall energy consumption of UAVs. To allow each UAV to adjust its position and transmit power solely based on local-rather the global-observations, a multi-agent reinforcement learning (MARL) framework is conceived. In order to overcome the non-stationarity issue of MARL and to endow the UAVs with distributed decision making capability, we resort to the centralized training in conjunction with decentralized execution paradigm. By judiciously designing the reward, we propose a decentralized joint trajectory and power control (DTPC) algorithm with significantly reduced complexity. Our simulation results show that the proposed DTPC algorithm outperforms the state-of-the-art deep reinforcement learning based methods, despite its low complexity.
引用
收藏
页码:3983 / 3988
页数:6
相关论文
共 50 条
  • [1] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
    Zhao, Nan
    Liu, Zehua
    Cheng, Yiqiang
    IEEE ACCESS, 2020, 8 : 139670 - 139679
  • [2] Multi-Agent Deep Reinforcement Learning-Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing
    Wang, Liang
    Wang, Kezhi
    Pan, Cunhua
    Xu, Wei
    Aslam, Nauman
    Hanzo, Lajos
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 73 - 84
  • [3] Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning
    Zhang C.Y.
    Liang S.Y.
    He C.L.
    Wang K.Z.
    Journal of Communications and Information Networks, 2022, 7 (02): : 192 - 201
  • [4] Multi-Agent Reinforcement Learning Trajectory Design and Two-Stage Resource Management in CoMP UAV VLC Networks
    Maleki, Mohammad Reza
    Mili, Mohammad Robat
    Javan, Mohammad Reza
    Mokari, Nader
    Jorswieck, Eduard A. A.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (11) : 7464 - 7476
  • [5] Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning
    Guo, Delin
    Tang, Lan
    Zhang, Xinggan
    Liang, Ying-Chang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13124 - 13138
  • [6] Multi-Agent Deep Reinforcement Learning for Decentralized Voltage-Var Control in Distribution Power System
    Zhang, Mengfan
    Xu, Qianwen
    Magnusson, Sindri
    Pilawa-Podgurski, Robert C. N.
    Guo, Guodong
    2022 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2022,
  • [7] UAV-Enabled Secure Communications by Multi-Agent Deep Reinforcement Learning
    Zhang, Yu
    Mou, Zhiyu
    Gao, Feifei
    Jiang, Jing
    Ding, Ruijin
    Han, Zhu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) : 11599 - 11611
  • [8] Multi-agent Deep Reinforcement Learning-based Trajectory Design for UAV-aided Edge Computing System
    Lu, Gengyuan
    Chang, Zheng
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [9] Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks
    Zhao, Nan
    Cheng, Yiqiang
    Pei, Yiyang
    Liang, Ying-Chang
    Niyato, Dusit
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [10] Distributed Drive Autonomous Vehicle Trajectory Tracking Control Based on Multi-Agent Deep Reinforcement Learning
    Liu, Yalei
    Ding, Weiping
    Yang, Mingliang
    Zhu, Honglin
    Liu, Liyuan
    Jin, Tianshi
    MATHEMATICS, 2024, 12 (11)