Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles

被引:3
|
作者
Yue, Longfei [1 ]
Yang, Rennong [1 ]
Zhang, Ying [1 ]
Zuo, Jialiang [1 ]
机构
[1] Air Force Engn Univ, Air Traff Control & Nav Coll, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-UAV; constrained Markov decision process; SAC-Lagrangian; transfer learning; reinforcement learning;
D O I
10.3389/fnbot.2022.1105480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A system with multiple cooperating unmanned aerial vehicles (multi-UAVs) can use its advantages to accomplish complicated tasks. Recent developments in deep reinforcement learning (DRL) offer good prospects for decision-making for multi-UAV systems. However, the safety and training efficiencies of DRL still need to be improved before practical use. This study presents a transfer-safe soft actor-critic (TSSAC) for multi-UAV decision-making. Decision-making by each UAV is modeled with a constrained Markov decision process (CMDP), in which safety is constrained to maximize the return. The soft actor-critic-Lagrangian (SAC-Lagrangian) algorithm is combined with a modified Lagrangian multiplier in the CMDP model. Moreover, parameter-based transfer learning is used to enable cooperative and efficient training of the tasks to the multi-UAVs. Simulation experiments indicate that the proposed method can improve the safety and training efficiencies and allow the UAVs to adapt to a dynamic scenario.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Research on Intelligent Merging Decision-making of Unmanned Vehicles Based on Reinforcement Learning
    Chen, Xue-mei
    Zhang, Qiang
    Zhang, Zhen-hua
    Liu, Ge-meng
    Gong, Jian-wei
    Chan, Ching-Yao
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 91 - 96
  • [2] Computational Architecture for Autonomous Decision-Making in Unmanned Aerial Vehicles
    Sankararaman, Shankar
    Goebel, Kai
    MICRO- AND NANOTECHNOLOGY SENSORS, SYSTEMS, AND APPLICATIONS X, 2018, 10639
  • [3] Cooperatively pursuing a target unmanned aerial vehicle by multiple unmanned aerial vehicles based on multiagent reinforcement learning
    Wang X.
    Xuan S.
    Ke L.
    Advanced Control for Applications: Engineering and Industrial Systems, 2020, 2 (02):
  • [4] A Comprehensive Driving Decision-Making Methodology Based on Deep Reinforcement Learning for Automated Commercial Vehicles
    Hu, Weiming
    Li, Xu
    Hu, Jinchao
    Liu, Yan
    Zhou, Jinying
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2024, 25 (06) : 1469 - 1483
  • [5] Model-Reference Reinforcement Learning for Safe Aerial Recovery of Unmanned Aerial Vehicles
    Zhao, Bocheng
    Huo, Mingying
    Yu, Ze
    Qi, Naiming
    Wang, Jianfeng
    AEROSPACE, 2024, 11 (01)
  • [6] A Reinforcement Learning-Based Fire Warning and Suppression System Using Unmanned Aerial Vehicles
    Panahi, Fereidoun H.
    Panahi, Farzad H.
    Ohtsuki, Tomoaki
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [7] Reinforcement Learning-Based Decision-Making for Vehicular Edge Computing
    Maleki, Homa
    Basaran, Mehmet
    Durak-Ata, Lutfiye
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [8] Reinforcement Learning-Based Intelligent Decision-Making for Communication Parameters
    Xie, Xia
    Dou, Zheng
    Zhang, Yabin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (09): : 2942 - 2960
  • [9] Iterative learning-based formation control for multiple quadrotor unmanned aerial vehicles
    Zhao, Zhihui
    Wang, Jing
    Chen, Yangquan
    Ju, Shuang
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (02)
  • [10] Learning-based Wildfire Tracking with Unmanned Aerial Vehicles
    Jia, Qiong
    Xin, Ming
    Hu, Xiaolin
    Chao, Haiyang
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3212 - 3217