Cooperative UAV Trajectory Design for Disaster Area Emergency Communications: A Multiagent PPO Method

被引:23
作者
Guan, Yue [1 ]
Zou, Sai [1 ]
Peng, Haixia [2 ]
Ni, Wei [3 ]
Sun, Yanglong [4 ]
Gao, Hongfeng [1 ]
机构
[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550000, Peoples R China
[2] Xi An Jiao Tong Univ, Coll Comp Sci & Technol, Xian 710049, Peoples R China
[3] CSIRO, Data61 Business Unit, Sydney, NSW 2122, Australia
[4] Jimei Univ, Nav Coll, Xiamen 361021, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous aerial vehicles; Radio frequency; Trajectory optimization; Signal to noise ratio; Interference; Vehicle dynamics; Channel models; Free space optical (FSO); K-means; multiagent proximal policy optimization (MAPPO); radio frequency (RF); trajectory optimization; unmanned aerial vehicle (UAV);
D O I
10.1109/JIOT.2023.3320796
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the issue of cooperative real-time trajectory design for multiple unmanned aerial vehicles (UAVs) to support emergency communication in disaster areas. To restore communication links rapidly between mobile users (MUs) and the ground base stations, UAVs equipped with both radio frequency (RF) modules and free space optics (FSO) modules are utilized as relay nodes. Given the challenges of setting up a central controller for the UAVs and the urgency of emergency communication, the trajectory design problem for these UAVs is formulated as a distributed cooperative optimization problem. Based on the enhanced K-mean algorithm and multiagent PPO (MAPPO) algorithm, a cooperative trajectory design method, abbreviated as KMAPPO, is proposed for the UAVs to minimize interaction overhead and optimize deployment efficiency. Compared to the state-of-the-art deep reinforcement learning (DRL) methods, simulations reveal KMAPPO's superior performance. It converges 32% faster, boosts RF allocation efficiency, and augments FSO communication backhaul capacity.
引用
收藏
页码:8848 / 8859
页数:12
相关论文
共 41 条
[1]   FSO-Based Vertical Backhaul/Fronthaul Framework for 5G+Wireless Networks [J].
Alzenad, Mohamed ;
Shakir, Muhammad Z. ;
Yanikomeroglu, Halim ;
Alouini, Mohamed-Slim .
IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (01) :218-224
[2]  
[Anonymous], 2012, Rep. 36.828
[3]   Localization and Clustering Based on Swarm Intelligence in UAV Networks for Emergency Communications [J].
Arafat, Muhammad Yeasir ;
Moh, Sangman .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) :8958-8976
[4]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[5]   Performance Analysis of Hybrid FSO Systems Using FSO/RF-FSO Link Adaptation [J].
Bag, Banibrata ;
Das, Akinchan ;
Ansari, Imran Shafique ;
Prokes, Ales ;
Bose, Chayanika ;
Chandra, Aniruddha .
IEEE PHOTONICS JOURNAL, 2018, 10 (03)
[6]   Speed Distribution Curves for Pedestrians during Walking and Crossing [J].
Chandra, Satish ;
Bharti, Anish Kumar .
2ND CONFERENCE OF TRANSPORTATION RESEARCH GROUP OF INDIA (2ND CTRG), 2013, 104 :660-667
[7]   Multi-Agent Reinforcement Learning Based 3D Trajectory Design in Aerial-Terrestrial Wireless Caching Networks [J].
Chen, Yu-Jia ;
Liao, Kai-Min ;
Ku, Meng-Lin ;
Tso, Fung Po ;
Chen, Guan-Yi .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) :8201-8215
[8]   Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks [J].
Cui, Jingjing ;
Liu, Yuanwei ;
Nallanathan, Arumugam .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) :729-743
[9]   Performance Impact of LoS and NLoS Transmissions in Dense Cellular Networks [J].
Ding, Ming ;
Wang, Peng ;
Lopez-Perez, David ;
Mao, Guoqiang ;
Lin, Zihuai .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2016, 15 (03) :2365-2380
[10]   UAV-Aided Cooperation for FSO Communication Systems [J].
Fawaz, Wissam ;
Abou-Rjeily, Chadi ;
Assi, Chadi .
IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (01) :70-75