UAV-Based Emergency Communications: An Iterative Two-Stage Multiagent Soft Actor-Critic Approach for Optimal Association and Dynamic Deployment

被引：4

作者：

Cao, Yingjie ^{[1
]}

Luo, Yang ^{[1
]}

Yang, Haifen ^{[1
]}

Luo, Chunbo ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 16期

基金：

中国国家自然科学基金;

关键词：

Heuristic algorithms; Vehicle dynamics; Internet of Things; Wireless communication; Quality of service; Autonomous aerial vehicles; Training; Aerial base station; association policy; emergency communications; multiagent deep reinforcement learning (DRL); soft actor-critic; trajectory optimization; unmanned aerial vehicle (UAV); NETWORKS; BS;

D O I：

10.1109/JIOT.2023.3329346

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article investigates future emergency wireless communication systems based on multiple unmanned vehicles cooperative deployment. A terrestrial vehicle with wireless communication and management capabilities are deployed to release multiple unmanned aerial vehicles (UAVs) which will serve as aerial mobile stations (UAV-BSs) to cover a disaster affected area, forming an emergency Internet of Things (IoT) network. Under the proposed system architecture, we formulate a joint optimization challenge considering the UAV-BSs' dynamic deployment positions and the association policy between user equipments (UEs) and BSs to maximize the throughput and coverage in dynamic scenarios as a time-varying mixed-integer nonconvex sequential programming (MINSP) problem. To solve this problem, we first investigate the impact of decision delay caused by physical networking and computing environment on system performance to illustrate the urgent need for efficient algorithms. Then, a two-stage iterative training algorithm called centralized training multiagent soft actor-critic with branch-and-cut (CT-MASAC-BAC) is proposed for computing globally optimal solutions. Numerical results show that CT-MASAC-BAC outperforms the heuristic algorithms and other benchmark deep reinforcement learning algorithms in terms of system utility. Furthermore, the experimental results show that the proposed algorithm is scalable with an increasing number of deployed UAV-BSs, contributing to potentially increased performance with more serving UAV-BSs.

引用

页码：26610 / 26622

页数：13

共 44 条

[1] Al-Hourani A, 2014, IEEE GLOB COMM CONF, P2898, DOI 10.1109/GLOCOM.2014.7037248
[2] Optimal LAP Altitude for Maximum Coverage
Al-Hourani, Akram
Kandeepan, Sithamparanathan
Lardner, Simon
[J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) : 569 - 572
[3] 3-D Placement of an Unmanned Aerial Vehicle Base Station (UAV-BS) for Energy-Efficient Maximal Coverage
Alzenad, Mohamed
El-Keyi, Amr
Lagum, Faraj
Yanikomeroglu, Halim
[J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2017, 6 (04) : 434 - 437
[4] [Anonymous], 1972, REDUCIBILITY COMBINA, DOI [DOI 10.1007/978-1-4684-2001-2_9, 10.1007/978-3-540-68279-0-8, DOI 10.1007/978-1-4684-2001-29]
[5] A survey of mobility models for ad hoc network research
Camp, T
Boleng, J
Davies, V
[J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2002, 2 (05) : 483 - 502
[6] An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments
Chen, Zhiwei
Zhong, Yi
Ge, Xiaohu
Ma, Yi
[J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
[7] Clausen J., 1999, Ph.D. dissertation
[8] An Overview of Post-Disaster Emergency Communication Systems in the Future Networks
Deepak, G. C.
Ladas, Alexandros
Sambo, Yusuf Abdulrahman
Pervaiz, Haris
Politis, Christos
Imran, Muhammad Ali
[J]. IEEE WIRELESS COMMUNICATIONS, 2019, 26 (06) : 132 - 139
[9] Foerster JN, 2016, ADV NEUR IN, V29
[10] Fujimoto S, 2018, PR MACH LEARN RES, V80

← 1 2 3 4 5 →