Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles

被引：3

作者：

Yue, Longfei ^{[1
]}

Yang, Rennong ^{[1
]}

Zhang, Ying ^{[1
]}

Zuo, Jialiang ^{[1
]}

机构：

[1] Air Force Engn Univ, Air Traff Control & Nav Coll, Xian, Peoples R China

来源：

FRONTIERS IN NEUROROBOTICS | 2023年 / 16卷

基金：

中国国家自然科学基金;

关键词：

multi-UAV; constrained Markov decision process; SAC-Lagrangian; transfer learning; reinforcement learning;

D O I：

10.3389/fnbot.2022.1105480

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A system with multiple cooperating unmanned aerial vehicles (multi-UAVs) can use its advantages to accomplish complicated tasks. Recent developments in deep reinforcement learning (DRL) offer good prospects for decision-making for multi-UAV systems. However, the safety and training efficiencies of DRL still need to be improved before practical use. This study presents a transfer-safe soft actor-critic (TSSAC) for multi-UAV decision-making. Decision-making by each UAV is modeled with a constrained Markov decision process (CMDP), in which safety is constrained to maximize the return. The soft actor-critic-Lagrangian (SAC-Lagrangian) algorithm is combined with a modified Lagrangian multiplier in the CMDP model. Moreover, parameter-based transfer learning is used to enable cooperative and efficient training of the tasks to the multi-UAVs. Simulation experiments indicate that the proposed method can improve the safety and training efficiencies and allow the UAVs to adapt to a dynamic scenario.

引用

页数：16

共 50 条

[1] Research on Intelligent Merging Decision-making of Unmanned Vehicles Based on Reinforcement Learning
Chen, Xue-mei
Zhang, Qiang
Zhang, Zhen-hua
Liu, Ge-meng
Gong, Jian-wei
Chan, Ching-Yao
2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 91 - 96
[2] Computational Architecture for Autonomous Decision-Making in Unmanned Aerial Vehicles
Sankararaman, Shankar
Goebel, Kai
MICRO- AND NANOTECHNOLOGY SENSORS, SYSTEMS, AND APPLICATIONS X, 2018, 10639
[3] Cooperatively pursuing a target unmanned aerial vehicle by multiple unmanned aerial vehicles based on multiagent reinforcement learning
Wang X.
Xuan S.
Ke L.
Advanced Control for Applications: Engineering and Industrial Systems, 2020, 2 (02):
[4] A Comprehensive Driving Decision-Making Methodology Based on Deep Reinforcement Learning for Automated Commercial Vehicles
Hu, Weiming
Li, Xu
Hu, Jinchao
Liu, Yan
Zhou, Jinying
INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2024, 25 (06) : 1469 - 1483
[5] Model-Reference Reinforcement Learning for Safe Aerial Recovery of Unmanned Aerial Vehicles
Zhao, Bocheng
Huo, Mingying
Yu, Ze
Qi, Naiming
Wang, Jianfeng
AEROSPACE, 2024, 11 (01)
[6] A Reinforcement Learning-Based Fire Warning and Suppression System Using Unmanned Aerial Vehicles
Panahi, Fereidoun H.
Panahi, Farzad H.
Ohtsuki, Tomoaki
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[7] Reinforcement Learning-Based Decision-Making for Vehicular Edge Computing
Maleki, Homa
Basaran, Mehmet
Durak-Ata, Lutfiye
29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[8] Reinforcement Learning-Based Intelligent Decision-Making for Communication Parameters
Xie, Xia
Dou, Zheng
Zhang, Yabin
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (09): : 2942 - 2960
[9] Iterative learning-based formation control for multiple quadrotor unmanned aerial vehicles
Zhao, Zhihui
Wang, Jing
Chen, Yangquan
Ju, Shuang
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (02)
[10] Learning-based Wildfire Tracking with Unmanned Aerial Vehicles
Jia, Qiong
Xin, Ming
Hu, Xiaolin
Chao, Haiyang
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3212 - 3217

← 1 2 3 4 5 →