Reinforcement learning based robust tracking control for unmanned helicopter with state constraints and input saturation

被引:0
|
作者
Feng, Yiting [1 ]
Zhou, Ye [1 ]
Ho, Hann Woei [1 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
关键词
Reinforcement learning; Helicopter tracking control; Adaptive critic designs; Constrained system; Backstepping control; TIME-OPTIMAL-CONTROL; QUADROTOR; MRAC;
D O I
10.1016/j.ast.2024.109549
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
In this paper, an online adaptive optimal control scheme using reinforcement learning (RL) methodology is developed with applications to helicopters in the presence of input saturation and state constraints. Such a control scheme can overcome the strong nonlinearity and coupling dynamics of helicopters by deploying adaptive critic designs (ACDs). Firstly, the backstepping technique is employed to divide the helicopter system into a kinematic loop and a dynamic loop. In the kinematic loop, a constrained Hamilton-Jacobi-Bellman (HJB) equation containing a barrier function is designed to satisfy state constraints. In the dynamic loop, an input- dependent non-quadratic term is incorporated into the HJB equation to solve the input-constrained optimal control problem. Then, a radial basis function (RBF) neural network (NN) is introduced to establish actor-critic networks for the implementation of adaptive optimal control. The critic network is exploited to optimize the tracking performance, while the approximated optimal control for the nominal error dynamic model is derived from the actor network. Meanwhile, a disturbance observer based on RBF NN is designed to compensate for uncertain system dynamics and external disturbances. Using the concurrent learning technique, a novel online update law of actor-critic networks is designed to relax the persistence of excitation (PE) condition. Moreover, the uniform ultimate boundedness (UUB) of parameter estimation error and the asymptotic convergence of state tracking errors are proven through the Lyapunov-based stability analysis. Finally, simulation results are presented to demonstrate that the proposed control strategy is suitable and effective for the helicopter attitude and altitude tracking control problem.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Flexible-Fixed-Time-Performance-Based Adaptive Tracking Control for Unmanned Helicopter With Input Saturation
    He, Zhiyang
    Shi, Shuang
    Wang, Haibo
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2025, : 3496 - 3508
  • [2] Learning-based robust optimal tracking controller design for unmanned underwater vehicles with full-state and input constraints
    Dong, Botao
    Shi, Yi
    Xie, Wei
    Chen, Weixing
    Zhang, Weidong
    OCEAN ENGINEERING, 2023, 271
  • [3] Reinforcement Learning based Neuro-control Systems for an Unmanned Helicopter
    Lee, Dong Jin
    Bang, Hyochoong
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 2537 - 2540
  • [4] Attitude reinforcement learning control of an unmanned helicopter with verification
    An H.
    Xian B.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2019, 36 (04): : 516 - 524
  • [5] Switched Tracking Control for Unmanned Helicopter With Position Output Constraints
    Wang, Haibo
    Shi, Shuang
    Wang, Xinhua
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 481 - 486
  • [6] Reinforcement learning of composite disturbance observer based tracking control for unmanned aerial helicopter under outside disturbances
    Zhang, Chunyu
    Lu, Changyu
    Li, Tao
    Mao, Zehui
    AEROSPACE SCIENCE AND TECHNOLOGY, 2025, 161
  • [7] Safe reinforcement learning for affine nonlinear systems with state constraints and input saturation using control barrier functions
    Liu, Shihan
    Liu, Lijun
    Yu, Zhen
    NEUROCOMPUTING, 2023, 518 : 562 - 576
  • [8] Reinforcement learning based time-varying formation control for quadrotor unmanned aerial vehicles system with input saturation
    Chi Ma
    Yizhe Cao
    Dianbiao Dong
    Applied Intelligence, 2023, 53 : 28730 - 28744
  • [9] Reinforcement learning based time-varying formation control for quadrotor unmanned aerial vehicles system with input saturation
    Ma, Chi
    Cao, Yizhe
    Dong, Dianbiao
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28730 - 28744
  • [10] Reinforcement Learning Control for a 2-DOF Helicopter With State Constraints: Theory and Experiments
    Zhao, Zhijia
    He, Weitian
    Mu, Chaoxu
    Zou, Tao
    Hong, Keum-Shik
    Li, Han-Xiong
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 157 - 167