UAV-Based Emergency Communications: An Iterative Two-Stage Multiagent Soft Actor-Critic Approach for Optimal Association and Dynamic Deployment

被引：4

作者：

Cao, Yingjie ^{[1
]}

Luo, Yang ^{[1
]}

Yang, Haifen ^{[1
]}

Luo, Chunbo ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 16期

基金：

中国国家自然科学基金;

关键词：

Heuristic algorithms; Vehicle dynamics; Internet of Things; Wireless communication; Quality of service; Autonomous aerial vehicles; Training; Aerial base station; association policy; emergency communications; multiagent deep reinforcement learning (DRL); soft actor-critic; trajectory optimization; unmanned aerial vehicle (UAV); NETWORKS; BS;

D O I：

10.1109/JIOT.2023.3329346

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article investigates future emergency wireless communication systems based on multiple unmanned vehicles cooperative deployment. A terrestrial vehicle with wireless communication and management capabilities are deployed to release multiple unmanned aerial vehicles (UAVs) which will serve as aerial mobile stations (UAV-BSs) to cover a disaster affected area, forming an emergency Internet of Things (IoT) network. Under the proposed system architecture, we formulate a joint optimization challenge considering the UAV-BSs' dynamic deployment positions and the association policy between user equipments (UEs) and BSs to maximize the throughput and coverage in dynamic scenarios as a time-varying mixed-integer nonconvex sequential programming (MINSP) problem. To solve this problem, we first investigate the impact of decision delay caused by physical networking and computing environment on system performance to illustrate the urgent need for efficient algorithms. Then, a two-stage iterative training algorithm called centralized training multiagent soft actor-critic with branch-and-cut (CT-MASAC-BAC) is proposed for computing globally optimal solutions. Numerical results show that CT-MASAC-BAC outperforms the heuristic algorithms and other benchmark deep reinforcement learning algorithms in terms of system utility. Furthermore, the experimental results show that the proposed algorithm is scalable with an increasing number of deployed UAV-BSs, contributing to potentially increased performance with more serving UAV-BSs.

引用

页码：26610 / 26622

页数：13

共 44 条

[11] Ghanavi R, 2018, IEEE WCNC
[12] Guo JL, 2019, INT WIREL COMMUN, P1508
[13] Haarnoja T, 2019, Arxiv, DOI [arXiv:1812.05905, 10.48550/arxiv.1812.05905, DOI 10.48550/ARXIV.1812.05905]
[14] Haarnoja T, 2018, PR MACH LEARN RES, V80
[15] Ioffe S, 2015, PR MACH LEARN RES, V37, P448
[16] Kalantari E, 2016, IEEE VTS VEH TECHNOL
[17] Distributed Drone Base Station Positioning for Emergency Cellular Networks Using Reinforcement Learning
Klaine, Paulo V.
Nadas, Joao P. B.
Souza, Richard D.
Imran, Muhammad A.
[J]. COGNITIVE COMPUTATION, 2018, 10 (05) : 790 - 804
[18] A survey on unmanned aerial vehicle relaying networks
Li, Bing
Zhao, Shengjie
Miao, Ruiqin
Zhang, Rongqing
[J]. IET COMMUNICATIONS, 2021, 15 (10) : 1262 - 1272
[19] Lillicrap T.P., 2015, arXiv, DOI [10.48550/arXiv.1509.02971, DOI 10.48550/ARXIV.1509.02971]
[20] An Adaptive UAV Deployment Scheme for Emergency Networking
Lin, Na
Liu, Yuheng
Zhao, Liang
Wu, Dapeng Oliver
Wang, Yifan
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (04) : 2383 - 2398

← 1 2 3 4 5 →