UAV-Based Emergency Communications: An Iterative Two-Stage Multiagent Soft Actor-Critic Approach for Optimal Association and Dynamic Deployment

被引:4
作者
Cao, Yingjie [1 ]
Luo, Yang [1 ]
Yang, Haifen [1 ]
Luo, Chunbo [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 16期
基金
中国国家自然科学基金;
关键词
Heuristic algorithms; Vehicle dynamics; Internet of Things; Wireless communication; Quality of service; Autonomous aerial vehicles; Training; Aerial base station; association policy; emergency communications; multiagent deep reinforcement learning (DRL); soft actor-critic; trajectory optimization; unmanned aerial vehicle (UAV); NETWORKS; BS;
D O I
10.1109/JIOT.2023.3329346
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates future emergency wireless communication systems based on multiple unmanned vehicles cooperative deployment. A terrestrial vehicle with wireless communication and management capabilities are deployed to release multiple unmanned aerial vehicles (UAVs) which will serve as aerial mobile stations (UAV-BSs) to cover a disaster affected area, forming an emergency Internet of Things (IoT) network. Under the proposed system architecture, we formulate a joint optimization challenge considering the UAV-BSs' dynamic deployment positions and the association policy between user equipments (UEs) and BSs to maximize the throughput and coverage in dynamic scenarios as a time-varying mixed-integer nonconvex sequential programming (MINSP) problem. To solve this problem, we first investigate the impact of decision delay caused by physical networking and computing environment on system performance to illustrate the urgent need for efficient algorithms. Then, a two-stage iterative training algorithm called centralized training multiagent soft actor-critic with branch-and-cut (CT-MASAC-BAC) is proposed for computing globally optimal solutions. Numerical results show that CT-MASAC-BAC outperforms the heuristic algorithms and other benchmark deep reinforcement learning algorithms in terms of system utility. Furthermore, the experimental results show that the proposed algorithm is scalable with an increasing number of deployed UAV-BSs, contributing to potentially increased performance with more serving UAV-BSs.
引用
收藏
页码:26610 / 26622
页数:13
相关论文
共 44 条
  • [1] Al-Hourani A, 2014, IEEE GLOB COMM CONF, P2898, DOI 10.1109/GLOCOM.2014.7037248
  • [2] Optimal LAP Altitude for Maximum Coverage
    Al-Hourani, Akram
    Kandeepan, Sithamparanathan
    Lardner, Simon
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) : 569 - 572
  • [3] 3-D Placement of an Unmanned Aerial Vehicle Base Station (UAV-BS) for Energy-Efficient Maximal Coverage
    Alzenad, Mohamed
    El-Keyi, Amr
    Lagum, Faraj
    Yanikomeroglu, Halim
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2017, 6 (04) : 434 - 437
  • [4] [Anonymous], 1972, REDUCIBILITY COMBINA, DOI [DOI 10.1007/978-1-4684-2001-2_9, 10.1007/978-3-540-68279-0-8, DOI 10.1007/978-1-4684-2001-29]
  • [5] A survey of mobility models for ad hoc network research
    Camp, T
    Boleng, J
    Davies, V
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2002, 2 (05) : 483 - 502
  • [6] An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments
    Chen, Zhiwei
    Zhong, Yi
    Ge, Xiaohu
    Ma, Yi
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [7] Clausen J., 1999, Ph.D. dissertation
  • [8] An Overview of Post-Disaster Emergency Communication Systems in the Future Networks
    Deepak, G. C.
    Ladas, Alexandros
    Sambo, Yusuf Abdulrahman
    Pervaiz, Haris
    Politis, Christos
    Imran, Muhammad Ali
    [J]. IEEE WIRELESS COMMUNICATIONS, 2019, 26 (06) : 132 - 139
  • [9] Foerster JN, 2016, ADV NEUR IN, V29
  • [10] Fujimoto S, 2018, PR MACH LEARN RES, V80