Soft Actor-Critic Deep Reinforcement Learning for Fault-Tolerant Flight Control

被引:11
作者
Dally, K. [1 ]
van Kampen, E. [1 ]
机构
[1] Delft Univ Technol, Fac Aerosp Engn, Control & Simulat Div, POB 5058, NL-2600 GB Delft, Netherlands
来源
AIAA SCITECH 2022 FORUM | 2022年
关键词
D O I
10.2514/6.2022-2078
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Fault-tolerant flight control faces challenges, as developing a model-based controller for each unexpected failure is unrealistic, and online learning methods can handle limited system complexity due to their low sample efficiency. In this research, a model-free coupled-dynamics flight controller for a jet aircraft able to withstand multiple failure types is proposed. An offline-trained cascaded Soft Actor-Critic Deep Reinforcement Learning controller is successful on highly coupled maneuvers, including a coordinated 40 degrees-bank climbing turn with a normalized Mean Absolute Error of 2.64%. The controller is robust to six failure cases, including the rudder jammed at -15 degrees, the aileron effectiveness reduced by 70%, a structural failure, icing and a backward c.g. shift as the response is stable and the climbing turn is completed successfully. Robustness to biased sensor noise, atmospheric disturbances, and to varying initial flight conditions and reference signal shapes is also demonstrated.
引用
收藏
页数:20
相关论文
共 34 条
[1]  
Achiam J., 2018, Benchmarks
[2]  
Ba J. L., 2016, arXiv, DOI 10.48550/arXiv:1607.06450
[3]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[4]  
Bohn E, 2019, INT CONF UNMAN AIRCR, P523, DOI [10.1109/ICUAS.2019.8798254, 10.1109/icuas.2019.8798254]
[5]  
Delahaye D, 2014, AIR TRAFFIC MANAGEME, P205, DOI DOI 10.1007/978-4-431-54475-3_12
[6]   Helicopter trimming and tracking control using direct neural dynamic programming [J].
Enns, R ;
Si, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (04) :929-939
[7]   Online adaptive critic flight control [J].
Ferrari, S ;
Stengel, RF .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2004, 27 (05) :777-786
[8]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[9]  
Glorot X., 2010, P AISTATS SARD IT, P249
[10]  
Grondman F., 2018 AIAA GUID NAV C, P1, DOI [10.2514/6.2018-0385, DOI 10.2514/6.2018-0385]