Learning-Based Attitude Tracking Control With High-Performance Parameter Estimation

被引:9
作者
Dong, Hongyang [1 ]
Zhao, Xiaowei [1 ]
Hu, Qinglei [2 ]
Yang, Haoyang [2 ]
Qi, Pengyuan [3 ]
机构
[1] Univ Warwick, Intelligent Control & Smart Energy Res Grp, Sch Engn, Coventry CV4 7AL, W Midlands, England
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[3] Beihang Univ, Res Inst Frontier Sci, Beijing 100191, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Attitude control; Uncertainty; Task analysis; Optimal control; Cost function; Tracking; Mathematical models; Adaptive control; adaptive dynamic programming (ADP); attitude tracking control; parameter estimation; APPROXIMATE OPTIMAL-CONTROL; ADAPTIVE-CONTROL; STABILIZATION;
D O I
10.1109/TAES.2021.3130537
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This article aims to handle the optimal attitude tracking control tasks for rigid bodies via a reinforcement-learning-based control scheme, in which a constrained parameter estimator is designed to compensate system uncertainties accurately. This estimator guarantees the exponential convergence of estimation errors and can strictly keep all instant estimates always within predetermined bounds. Based on it, a critic-only adaptive dynamic programming (ADP) control strategy is proposed to learn the optimal control policy with respect to a user-defined cost function. The matching condition on reference control signals, which is commonly employed in relevant ADP design, is not required in the proposed control scheme. We prove the uniform ultimate boundedness of the tracking errors and critic weight's estimation errors under finite excitation conditions by Lyapunov-based analysis. Moreover, an easy-to-implement initial control policy is designed to trigger the real-time learning process. The effectiveness and advantages of the proposed method are verified by both numerical simulations and hardware-in-the-loop experimental tests.
引用
收藏
页码:2218 / 2230
页数:13
相关论文
共 32 条
  • [1] Immersion and invariance: A new tool for stabilization and adaptive control of nonlinear systems
    Astolfi, A
    Ortega, R
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (04) : 590 - 606
  • [2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    Bhasin, S.
    Kamalapurkar, R.
    Johnson, M.
    Vamvoudakis, K. G.
    Lewis, F. L.
    Dixon, W. E.
    [J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
  • [3] Exponential parameter and tracking error convergence guarantees for adaptive controllers without persistency of excitation
    Chowdhary, Girish
    Muehlegg, Maximilian
    Johnson, Eric
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (08) : 1583 - 1603
  • [4] Theory and Flight-Test Validation of a Concurrent-Learning Adaptive Controller
    Chowdhary, Girish V.
    Johnson, Eric N.
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2011, 34 (02) : 592 - 607
  • [5] Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints
    Dong, Hongyang
    Zhao, Xiaowei
    Yang, Haoyang
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2021, 29 (04) : 1664 - 1673
  • [6] Continuous Finite-Time Attitude Control for Rigid Spacecraft Based on Angular Velocity Observer
    Hu, Qinglei
    Jiang, Boyan
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2018, 54 (03) : 1082 - 1092
  • [7] Passivity-Based Attitude Synchronization in SE(3)
    Igarashi, Yuji
    Hatanaka, Takeshi
    Fujita, Masayuki
    Spong, Mark W.
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2009, 17 (05) : 1119 - 1134
  • [8] Ioannou P. A., 2012, ROBUST ADAPTIVE CONT
  • [9] Jiang, 2017, ROBUST ADAPTIVE DYNA
  • [10] Concurrent Learning for Parameter Estimation Using Dynamic State-Derivative Estimators
    Kamalapurkar, Rushikesh
    Reish, Benjamin
    Chowdhary, Girish
    Dixon, Warren E.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (07) : 3594 - 3601