Reinforcement learning based robust tracking control for unmanned helicopter with state constraints and input saturation

被引:0
|
作者
Feng, Yiting [1 ]
Zhou, Ye [1 ]
Ho, Hann Woei [1 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
关键词
Reinforcement learning; Helicopter tracking control; Adaptive critic designs; Constrained system; Backstepping control; TIME-OPTIMAL-CONTROL; QUADROTOR; MRAC;
D O I
10.1016/j.ast.2024.109549
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
In this paper, an online adaptive optimal control scheme using reinforcement learning (RL) methodology is developed with applications to helicopters in the presence of input saturation and state constraints. Such a control scheme can overcome the strong nonlinearity and coupling dynamics of helicopters by deploying adaptive critic designs (ACDs). Firstly, the backstepping technique is employed to divide the helicopter system into a kinematic loop and a dynamic loop. In the kinematic loop, a constrained Hamilton-Jacobi-Bellman (HJB) equation containing a barrier function is designed to satisfy state constraints. In the dynamic loop, an input- dependent non-quadratic term is incorporated into the HJB equation to solve the input-constrained optimal control problem. Then, a radial basis function (RBF) neural network (NN) is introduced to establish actor-critic networks for the implementation of adaptive optimal control. The critic network is exploited to optimize the tracking performance, while the approximated optimal control for the nominal error dynamic model is derived from the actor network. Meanwhile, a disturbance observer based on RBF NN is designed to compensate for uncertain system dynamics and external disturbances. Using the concurrent learning technique, a novel online update law of actor-critic networks is designed to relax the persistence of excitation (PE) condition. Moreover, the uniform ultimate boundedness (UUB) of parameter estimation error and the asymptotic convergence of state tracking errors are proven through the Lyapunov-based stability analysis. Finally, simulation results are presented to demonstrate that the proposed control strategy is suitable and effective for the helicopter attitude and altitude tracking control problem.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Waypoint Tracking Control for a Quadrotor based on PID and Reinforcement Learning
    Bao, Xurui
    Jing, Zhouhui
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2023, 25 (01): : 90 - 100
  • [32] Time optimal control of triple integrator with input saturation and full state constraints
    He, Suqin
    Hu, Chuxiong
    Zhu, Yu
    Tomizuka, Masayoshi
    AUTOMATICA, 2020, 122 (122)
  • [33] Adaptive fault tolerant control for hypersonic vehicle with input saturation and state constraints
    Sun, Jing-Guang
    Li, Chuan-Ming
    Guo, Yong
    Wang, Chang-Qing
    Li, Peng
    ACTA ASTRONAUTICA, 2020, 167 : 302 - 313
  • [34] Reinforcement learning-based tracking control for a quadrotor unmanned aerial vehicle under external disturbances
    Liu, Hui
    Li, Bo
    Xiao, Bing
    Ran, Dechao
    Zhang, Chengxi
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (17) : 10360 - 10377
  • [35] Robust tracking control for uncertain MIMO nonlinear systems with input saturation using RWNNDO
    Chen, Mou
    Zhou, Yanlong
    Guo, William W.
    NEUROCOMPUTING, 2014, 144 : 436 - 447
  • [36] Continuous Control for Moving Object Tracking of Unmanned Skid-Steered Vehicle Based on Reinforcement Learning
    Li, Zheng
    Zhou, Junjie
    Li, Xueyuan
    Du, Xu
    Wang, Lei
    Wang, Yun
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 456 - 461
  • [37] Distributed optimal formation tracking control based on reinforcement learning for underactuated AUVs with asymmetric constraints
    Wang, Zhengkun
    Zhang, Lijun
    OCEAN ENGINEERING, 2023, 280
  • [38] Tracking and Aiming Adaptive Control for Unmanned Combat Ground Vehicle on the Move Based on Reinforcement Learning Compensation
    Wei L.
    Gong J.
    Chen H.
    Li Z.
    Gong C.
    Binggong Xuebao/Acta Armamentarii, 2022, 43 (08): : 1947 - 1955
  • [39] Fixed-Time Convergence Adaptive Robust Controller for Overhead Cranes Based on Reinforcement Learning Considering Input Saturation
    Wei, Junren
    Xu, Weimin
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2025,
  • [40] Optimal tracking control based on reinforcement learning value iteration algorithm for time-delayed nonlinear systems with external disturbances and input constraints
    Mohammadi, Mehdi
    Arefi, Mohammad Mehdi
    Setoodeh, Peyman
    Kaynak, Okyay
    INFORMATION SCIENCES, 2021, 554 : 84 - 98