A novel model-free robust saturated reinforcement learning-based controller for quadrotors guaranteeing prescribed transient and steady state performance

被引：31

作者：

Elhaki, Omid ^{[1
]}

Shojaei, Khoshnam ^{[1
,2
]}

机构：

[1] Islamic Azad Univ, Dept Elect Engn, Najafabad Branch, Daneshgah Blvd, Najafabad 8514143131, Iran

[2] Islamic Azad Univ, Digital Proc & Machine Vis Res Ctr, Najafabad Branch, Najafabad, Iran

来源：

AEROSPACE SCIENCE AND TECHNOLOGY | 2021年 / 119卷

关键词：

Reinforcement learning; Saturation function; Prescribed performance; Actuator saturation; Quadrotor; ADAPTIVE-CONTROL; TRACKING CONTROL; NONLINEAR-SYSTEMS; FEEDBACK-CONTROL; NEURAL-CONTROL; UAV; DESIGN; AFFINE; QUAVS;

D O I：

10.1016/j.ast.2021.107128

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

For the purpose of improving the performance of trajectory tracking for quadrotors with the control input saturation, a novel model-free saturated prescribed performance reinforcement learning framework is proposed in the presence of the model uncertainties, nonlinearities and external disturbances. In this paper, saturation functions are employed to deal with input saturation, and the actuator's saturation nonlinearity is compensated by an intelligent method to decrease the saturation effects. Moreover, the prescribed performance control is utilized to ensure an adjustable transient and steady state response for the tracking errors. Besides, adaptive robust controllers are introduced to handle the effects of external disturbances online. A novel controller is proposed in collaboration with a reinforcement learning method based on actor-critic neural networks. The actor neural network is employed to estimate nonlinearities, actuator saturation nonlinearity, and model uncertainties, and the critic neural network is applied to estimate the reinforcement signals, which regulates the control action of the actor neural network online. The proposed actor-critic-based control structure benefits from a model-free calculation and only depends on the measurable signals of the closed-loop system. This freedom from system dynamics leads to a significant low computational load for the controller and, therefore, the proposed control method is computationally cost-effective. The adaptive robust controllers and the proposed actor-critic structures are trained online, and the convergence behavior of their learning laws is investigated in the course of stability examination. For the proof of stability, Lyapunov's direct method is used to show that all error variables of the closed-loop nonlinear control system are uniformly ultimately bounded. Finally, simulations along with some quantitative comparisons verify the efficiency and usefulness of the proposed control scheme. (C) 2021 Elsevier Masson SAS. All rights reserved.

引用

页数：29

共 79 条

[1] Three-loop uncertainties compensator and sliding mode quadrotor control [J].

Alqaisi, Walid ;

Ghommam, Jawhar ;

Alazzam, Anas ;

Saad, Maarouf ;

Nerguizian, Vahe .

COMPUTERS & ELECTRICAL ENGINEERING, 2020, 81

[2] Model predictive control of three-axis gimbal system mounted on UAV for real-time target tracking under external disturbances [J].

Altan, Aytac ;

Hacioglu, Rifat .

MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 138

[3]

Altan Aytac, 2018, 2018 6 INT C CONTROL

[4]

Altayeva Aigerim, 2017, 2017 17th International Conference on Control, Automation and Systems (ICCAS). Proceedings, P1, DOI 10.23919/ICCAS.2017.8204281

[5]

[Anonymous], 2013, Optimal adaptive control and differential games by reinforcement learning principles

[6]

[Anonymous], 2017, 5 INT C ADV MECH ROB

[7] Quaternion-based nonlinear attitude control of quadrotor formations carrying a slung load [J].

Ariyibi, Segun O. ;

Tekinalp, Ozan .

AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 105

[8] Robust optimal motion planning approach to cooperative grasping and transporting using multiple UAVs based on SDRE [J].

Babaie, Raziyeh ;

Ehyaie, Amir Farhad .

TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2017, 39 (09) :1391-1408

[9] Robust Adaptive Control of Feedback Linearizable MIMO Nonlinear Systems With Prescribed Performance [J].

Bechlioulis, Charalampos P. ;

Rovithakis, George A. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (09) :2090-2099

[10] Prescribed Performance Adaptive Control for Multi-Input Multi-Output Affine in the Control Nonlinear Systems [J].

Bechlioulis, Charalampos P. ;

Rovithakis, George A. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (05) :1220-1226

← 1 2 3 4 5 6 7 8 →