Unmanned Aerial Vehicle Pitch Control Using Deep Reinforcement Learning with Discrete Actions in Wind Tunnel Test

被引：15

作者：

Wada, Daichi ^{[1
]}

Araujo-Estrada, Sergio A. ^{[2
]}

Windsor, Shane ^{[2
]}

机构：

[1] Japan Aerosp Explorat Agcy, Aeronaut Technol Directorate, Tokyo 1810015, Japan

[2] Univ Bristol, Dept Aerosp Engn, Bristol BS8 1TR, Avon, England

来源：

AEROSPACE | 2021年 / 8卷 / 01期

基金：

欧洲研究理事会;

关键词：

attitude control; deep reinforcement learning; fixed-wing aircraft; unmanned aerial vehicle; wind tunnel test;

D O I：

10.3390/aerospace8010018

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Deep reinforcement learning is a promising method for training a nonlinear attitude controller for fixed-wing unmanned aerial vehicles. Until now, proof-of-concept studies have demonstrated successful attitude control in simulation. However, detailed experimental investigations have not yet been conducted. This study applied deep reinforcement learning for one-degree-of-freedom pitch control in wind tunnel tests with the aim of gaining practical understandings of attitude control application. Three controllers with different discrete action choices, that is, elevator angles, were designed. The controllers with larger action rates exhibited better performance in terms of following angle-of-attack commands. The root mean square errors for tracking angle-of-attack commands decreased from 3.42 degrees to 1.99 degrees as the maximum action rate increased from 10 degrees/s to 50 degrees/s. The comparison between experimental and simulation results showed that the controller with a smaller action rate experienced the friction effect, and the controllers with larger action rates experienced fluctuating behaviors in elevator maneuvers owing to delay. The investigation of the effect of friction and delay on pitch control highlighted the importance of conducting experiments to understand actual control performances, specifically when the controllers were trained with a low-fidelity model.

引用

页码：1 / 16

页数：16

共 27 条

[1]

Bohn E, 2019, INT CONF UNMAN AIRCR, P523, DOI [10.1109/icuas.2019.8798254, 10.1109/ICUAS.2019.8798254]

[2] Soft biohybrid morphing wings with feathers underactuated by wrist and finger motion [J].

Chang, Eric ;

Matloff, Laura Y. ;

Stowers, Amanda K. ;

Lentink, David .

SCIENCE ROBOTICS, 2020, 5 (38)

[3]

Dadian O, 2016, P AMER CONTR CONF, P1341, DOI 10.1109/ACC.2016.7525104

[4] Bioinspired morphing wings for extended flight envelope and roll control of small drones [J].

Di Luca, M. ;

Mintchev, S. ;

Heitz, G. ;

Noca, F. ;

Floreano, D. .

INTERFACE FOCUS, 2017, 7 (01)

[5] Classical/neural synthesis of nonlinear control systems [J].

Ferrari, S ;

Stengel, RF .

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2002, 25 (03) :442-448

[6]

Gu S., 2016, ARXIV 2016 1603 0074

[7]

Gu WB, 2019, INT CONF UNMAN AIRCR, P362, DOI [10.1109/icuas.2019.8797853, 10.1109/ICUAS.2019.8797853]

[8]

Hwang I., 2020, P AIAA FOR ORLANDO

[9] Deep Neural Network Compression for Aircraft Collision Avoidance Systems [J].

Julian, Kyle D. ;

Kochenderfer, Mykel J. ;

Owen, Michael P. .

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2019, 42 (03) :598-608

[10]

Kim B.S., 1993, Proceedings. The First IEEE Regional Conference on Aerospace Control Systems, P176

← 1 2 3 →