Reinforcement learning based robust tracking control for unmanned helicopter with state constraints and input saturation

被引:0
作者
Feng, Yiting [1 ]
Zhou, Ye [1 ]
Ho, Hann Woei [1 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
关键词
Reinforcement learning; Helicopter tracking control; Adaptive critic designs; Constrained system; Backstepping control; TIME-OPTIMAL-CONTROL; QUADROTOR; MRAC;
D O I
10.1016/j.ast.2024.109549
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
In this paper, an online adaptive optimal control scheme using reinforcement learning (RL) methodology is developed with applications to helicopters in the presence of input saturation and state constraints. Such a control scheme can overcome the strong nonlinearity and coupling dynamics of helicopters by deploying adaptive critic designs (ACDs). Firstly, the backstepping technique is employed to divide the helicopter system into a kinematic loop and a dynamic loop. In the kinematic loop, a constrained Hamilton-Jacobi-Bellman (HJB) equation containing a barrier function is designed to satisfy state constraints. In the dynamic loop, an input- dependent non-quadratic term is incorporated into the HJB equation to solve the input-constrained optimal control problem. Then, a radial basis function (RBF) neural network (NN) is introduced to establish actor-critic networks for the implementation of adaptive optimal control. The critic network is exploited to optimize the tracking performance, while the approximated optimal control for the nominal error dynamic model is derived from the actor network. Meanwhile, a disturbance observer based on RBF NN is designed to compensate for uncertain system dynamics and external disturbances. Using the concurrent learning technique, a novel online update law of actor-critic networks is designed to relax the persistence of excitation (PE) condition. Moreover, the uniform ultimate boundedness (UUB) of parameter estimation error and the asymptotic convergence of state tracking errors are proven through the Lyapunov-based stability analysis. Finally, simulation results are presented to demonstrate that the proposed control strategy is suitable and effective for the helicopter attitude and altitude tracking control problem.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Adaptive Finite-Time 6-DOF Tracking Control for Spacecraft Fly Around With Input Saturation and State Constraints
    Huang, Yi
    Jia, Yingmin
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2019, 55 (06) : 3259 - 3272
  • [42] Robust tracking control with reinforcement learning for nonlinear-constrained systems
    Tang, Yuhong
    Yang, Xiong
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (18) : 9902 - 9919
  • [43] Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation
    Bai, Weiwei
    Zhou, Qi
    Li, Tieshan
    Li, Hongyi
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (08) : 3433 - 3443
  • [44] Application of reinforcement learning to RC helicopter control
    Murao, H
    Tamaki, H
    Kitamura, S
    SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 2306 - 2309
  • [45] Reinforcement learning-based optimal trajectory tracking control of surface vessels under input saturations
    Wei, Ziping
    Du, Jialu
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (06) : 3807 - 3825
  • [46] Aggressive and robust low-level control and trajectory tracking for quadrotors with deep reinforcement learning
    Shiyu Chen
    Yanjie Li
    Yunjiang Lou
    Ke Lin
    Neural Computing and Applications, 2025, 37 (3) : 1223 - 1240
  • [47] Reinforcement learning-based trajectory tracking optimal control of unmanned surface vehicles in narrow water areas
    Wei, Ziping
    Du, Jialu
    ISA TRANSACTIONS, 2025, 159 : 152 - 164
  • [48] Robust trajectory tracking control for unmanned surface vessels under motion constraints and environmental disturbances
    Zhang, Ruo
    Liu, Yuanchang
    Anderlini, Enrico
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART M-JOURNAL OF ENGINEERING FOR THE MARITIME ENVIRONMENT, 2022, 236 (02) : 394 - 411
  • [49] Monte Carlo-based reinforcement learning control for unmanned aerial vehicle systems
    Wei, Qinglai
    Yang, Zesheng
    Su, Huaizhong
    Wang, Lijian
    NEUROCOMPUTING, 2022, 507 : 282 - 291
  • [50] Robust constrained autopilot control for a generic missile in the presence of state and input constraints
    Feng, Zhenxin
    Guo, Jianguo
    Zhou, Jun
    OPTIK, 2017, 149 : 49 - 58