Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs

被引：0

作者：

Zhen, Yan ^{[1
]}

Hao, Mingrui ^{[1
]}

Sun, Wendi ^{[1
]}

机构：

[1] Sci & Technol Complex Syst Control & Intelligent, Beijing, Peoples R China

来源：

PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS) | 2020年

关键词：

aircraft; reinforcement learning; controller; attitude; policy;

D O I：

10.1109/icus50048.2020.9274875

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The fixed-wing UAV is a non-linear and strongly coupled system. Controlling UAV attitude stability is the basis for ensuring flight safety and performing tasks successfully. The non-linear characteristic of the UAV is the main reason for the difficulty of attitude stabilization. Deep reinforcement learning for the UAV attitude control is a new method to design controller. The algorithm learns the nonlinear characteristics of the system from the training data. Due to the good performance, the PPO algorithm is the mainly algorithm of reinforcement learning. The PPO algorithm interacts with the reinforcement learning training environment by gazebo, and improve attitude controller, different from the traditional PID control method, the attitude controller based on deep reinforcement learning uses the neural network to generate control signals and controls the rotation of rudder directly.

引用

页码：239 / 244

页数：6

共 19 条

[1]

AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759

[2]

[Anonymous], 2016, CoRR abs/1606.01540

[3]

[Anonymous], PROXIMAL POLICY OPTI

[4] Extended tanh-function method and its applications to nonlinear equations [J].

Fan, EG .

PHYSICS LETTERS A, 2000, 277 (4-5) :212-218

[5]

Furrer F, 2016, STUD COMPUT INTELL, V625, P595, DOI 10.1007/978-3-319-26054-9_23

[6]

Hill A., 2018, Stable baselines

[7] Linear Tracking for a Fixed-Wing UAV Using Nonlinear Model Predictive Control [J].

Kang, Yeonsik ;

Hedrick, J. Karl .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2009, 17 (05) :1202-1210

[8] Reinforcement Learning for UAV Attitude Control [J].

Koch, William ;

Mancuso, Renato ;

West, Richard ;

Bestavros, Azer .

ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2019, 3 (02)

[9]

Mnih V Badia, 2016, ASYNCHRONOUS METHODS

[10]

Narayanamoorthy A, 2015, PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), P142, DOI 10.1109/ICCIS.2015.7274563

← 1 2 →