Soft Actor-Critic Deep Reinforcement Learning for Fault-Tolerant Flight Control

被引：11

作者：

Dally, K. ^{[1
]}

van Kampen, E. ^{[1
]}

机构：

[1] Delft Univ Technol, Fac Aerosp Engn, Control & Simulat Div, POB 5058, NL-2600 GB Delft, Netherlands

来源：

AIAA SCITECH 2022 FORUM | 2022年

关键词：

D O I：

10.2514/6.2022-2078

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Fault-tolerant flight control faces challenges, as developing a model-based controller for each unexpected failure is unrealistic, and online learning methods can handle limited system complexity due to their low sample efficiency. In this research, a model-free coupled-dynamics flight controller for a jet aircraft able to withstand multiple failure types is proposed. An offline-trained cascaded Soft Actor-Critic Deep Reinforcement Learning controller is successful on highly coupled maneuvers, including a coordinated 40 degrees-bank climbing turn with a normalized Mean Absolute Error of 2.64%. The controller is robust to six failure cases, including the rudder jammed at -15 degrees, the aileron effectiveness reduced by 70%, a structural failure, icing and a backward c.g. shift as the response is stable and the climbing turn is completed successfully. Robustness to biased sensor noise, atmospheric disturbances, and to varying initial flight conditions and reference signal shapes is also demonstrated.

引用

页数：20

共 34 条

[1]

Achiam J., 2018, Benchmarks

[2]

Ba J. L., 2016, arXiv, DOI 10.48550/arXiv:1607.06450

[3] DYNAMIC PROGRAMMING [J].

BELLMAN, R .

SCIENCE, 1966, 153 (3731) :34-&

[4]

Bohn E, 2019, INT CONF UNMAN AIRCR, P523, DOI [10.1109/ICUAS.2019.8798254, 10.1109/icuas.2019.8798254]

[5]

Delahaye D, 2014, AIR TRAFFIC MANAGEME, P205, DOI DOI 10.1007/978-4-431-54475-3_12

[6] Helicopter trimming and tracking control using direct neural dynamic programming [J].

Enns, R ;

Si, J .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (04) :929-939

[7] Online adaptive critic flight control [J].

Ferrari, S ;

Stengel, RF .

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2004, 27 (05) :777-786

[8]

Fujimoto S, 2018, PR MACH LEARN RES, V80

[9]

Glorot X., 2010, P AISTATS SARD IT, P249

[10]

Grondman F., 2018 AIAA GUID NAV C, P1, DOI [10.2514/6.2018-0385, DOI 10.2514/6.2018-0385]

← 1 2 3 4 →