Evolutionary Reinforcement Learning: Hybrid Approach for Safety-Informed Fault-Tolerant Flight Control

被引:0
作者
Gavra, Vlad [1 ]
van Kampen, Erik-Jan [1 ]
机构
[1] Delft Univ Technol, Fac Aerosp Engn, Control & Simulat Sect, POB 5058, NL-2600 GB Delft, Netherlands
关键词
Reinforcement Learning; Fixed Wing Aircraft; Artificial Intelligence; Fault-tolerant Flight Control; Deep Reinforcement Learning; Evolutionary Algorithms; Control Policy Noise; POSE ESTIMATION; RELATIVE NAVIGATION; UNCOOPERATIVE SPACECRAFT; CIRCLE;
D O I
10.2514/1.G008112
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Recent research in artificial intelligence potentially provides solutions to the challenging problem of fault-tolerant and robust flight control. This paper proposes a novel Safety-Informed Evolutionary Reinforcement Learning algorithm (SERL), which combines Deep Reinforcement Learning (DRL) and neuroevolution to optimize a population of nonlinear control policies. Using SERL, the work has trained agents to provide attitude tracking on a high-fidelity nonlinear fixed-wing aircraft model. Compared to a state-of-the-art DRL solution, SERL achieves better tracking performance in nine out of ten cases, remaining robust against faults and changes in flight conditions, while providing smoother action signals.
引用
收藏
页码:887 / 900
页数:14
相关论文
共 51 条
[1]  
Allard Maxime, 2023, ACM Transactions on Evolutionary Learning, V3, P1, DOI DOI 10.1145/3596912
[2]  
[Anonymous], 2020, Safety Report
[3]  
[Anonymous], 2021, Certification Specifications and Acceptable Means of Compliance for Large Aeroplanes (CS-25)
[4]  
Bertsekas D., 2019, REINFORCEMENT LEARNI
[5]  
Bodnar C., 2020, Proceedings of the AAAI Conference on Artificial Intelligence, V34, P3283, DOI [10.1609/aaai.v34i04.5728, DOI 10.1609/AAAI.V34I04.5728]
[6]  
Bohn E, 2019, INT CONF UNMAN AIRCR, P523, DOI [10.1109/icuas.2019.8798254, 10.1109/ICUAS.2019.8798254]
[7]   Worst-Case Analysis of Complex Nonlinear Flight Control Designs Using Deep Q-Learning [J].
Braun, David ;
Marb, Michael M. ;
Angelov, Jorg ;
Wechner, Maximilian ;
Holzapfel, Florian .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2023, 46 (07) :1365-1377
[8]  
Byoung-Tzk Zhang, 1993, Complex Systems, V7, P199
[9]   Soft Actor-Critic with Inhibitory Networks for Retraining UAV Controllers Faster [J].
Choi, Minkyu ;
Filter, Max ;
Alcedo, Kevin ;
Walker, Thayne T. ;
Rosenbluth, David ;
Ide, Jaime S. .
2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, :1561-1570
[10]   Robots that can adapt like animals [J].
Cully, Antoine ;
Clune, Jeff ;
Tarapore, Danesh ;
Mouret, Jean-Baptiste .
NATURE, 2015, 521 (7553) :503-U476