Evolutionary Reinforcement Learning: Hybrid Approach for Safety-Informed Fault-Tolerant Flight Control

被引：0

作者：

Gavra, Vlad ^{[1
]}

van Kampen, Erik-Jan ^{[1
]}

机构：

[1] Delft Univ Technol, Fac Aerosp Engn, Control & Simulat Sect, POB 5058, NL-2600 GB Delft, Netherlands

来源：

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS | 2024年 / 47卷 / 05期

关键词：

Reinforcement Learning; Fixed Wing Aircraft; Artificial Intelligence; Fault-tolerant Flight Control; Deep Reinforcement Learning; Evolutionary Algorithms; Control Policy Noise; POSE ESTIMATION; RELATIVE NAVIGATION; UNCOOPERATIVE SPACECRAFT; CIRCLE;

D O I：

10.2514/1.G008112

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Recent research in artificial intelligence potentially provides solutions to the challenging problem of fault-tolerant and robust flight control. This paper proposes a novel Safety-Informed Evolutionary Reinforcement Learning algorithm (SERL), which combines Deep Reinforcement Learning (DRL) and neuroevolution to optimize a population of nonlinear control policies. Using SERL, the work has trained agents to provide attitude tracking on a high-fidelity nonlinear fixed-wing aircraft model. Compared to a state-of-the-art DRL solution, SERL achieves better tracking performance in nine out of ten cases, remaining robust against faults and changes in flight conditions, while providing smoother action signals.

引用

页码：887 / 900

页数：14

共 51 条

[1]

Allard Maxime, 2023, ACM Transactions on Evolutionary Learning, V3, P1, DOI DOI 10.1145/3596912

[2]

[Anonymous], 2020, Safety Report

[3]

[Anonymous], 2021, Certification Specifications and Acceptable Means of Compliance for Large Aeroplanes (CS-25)

[4]

Bertsekas D., 2019, REINFORCEMENT LEARNI

[5]

Bodnar C., 2020, Proceedings of the AAAI Conference on Artificial Intelligence, V34, P3283, DOI [10.1609/aaai.v34i04.5728, DOI 10.1609/AAAI.V34I04.5728]

[6]

Bohn E, 2019, INT CONF UNMAN AIRCR, P523, DOI [10.1109/icuas.2019.8798254, 10.1109/ICUAS.2019.8798254]

[7] Worst-Case Analysis of Complex Nonlinear Flight Control Designs Using Deep Q-Learning [J].

Braun, David ;

Marb, Michael M. ;

Angelov, Jorg ;

Wechner, Maximilian ;

Holzapfel, Florian .

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2023, 46 (07) :1365-1377

[8]

Byoung-Tzk Zhang, 1993, Complex Systems, V7, P199

[9] Soft Actor-Critic with Inhibitory Networks for Retraining UAV Controllers Faster [J].

Choi, Minkyu ;

Filter, Max ;

Alcedo, Kevin ;

Walker, Thayne T. ;

Rosenbluth, David ;

Ide, Jaime S. .

2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, :1561-1570

[10] Robots that can adapt like animals [J].

Cully, Antoine ;

Clune, Jeff ;

Tarapore, Danesh ;

Mouret, Jean-Baptiste .

NATURE, 2015, 521 (7553) :503-U476

← 1 2 3 4 5 6 →