Learning-Based Attitude Tracking Control With High-Performance Parameter Estimation

被引：9

作者：

Dong, Hongyang ^{[1
]}

Zhao, Xiaowei ^{[1
]}

Hu, Qinglei ^{[2
]}

Yang, Haoyang ^{[2
]}

Qi, Pengyuan ^{[3
]}

机构：

[1] Univ Warwick, Intelligent Control & Smart Energy Res Grp, Sch Engn, Coventry CV4 7AL, W Midlands, England

[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[3] Beihang Univ, Res Inst Frontier Sci, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS | 2022年 / 58卷 / 03期

基金：

英国工程与自然科学研究理事会;

关键词：

Attitude control; Uncertainty; Task analysis; Optimal control; Cost function; Tracking; Mathematical models; Adaptive control; adaptive dynamic programming (ADP); attitude tracking control; parameter estimation; APPROXIMATE OPTIMAL-CONTROL; ADAPTIVE-CONTROL; STABILIZATION;

D O I：

10.1109/TAES.2021.3130537

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This article aims to handle the optimal attitude tracking control tasks for rigid bodies via a reinforcement-learning-based control scheme, in which a constrained parameter estimator is designed to compensate system uncertainties accurately. This estimator guarantees the exponential convergence of estimation errors and can strictly keep all instant estimates always within predetermined bounds. Based on it, a critic-only adaptive dynamic programming (ADP) control strategy is proposed to learn the optimal control policy with respect to a user-defined cost function. The matching condition on reference control signals, which is commonly employed in relevant ADP design, is not required in the proposed control scheme. We prove the uniform ultimate boundedness of the tracking errors and critic weight's estimation errors under finite excitation conditions by Lyapunov-based analysis. Moreover, an easy-to-implement initial control policy is designed to trigger the real-time learning process. The effectiveness and advantages of the proposed method are verified by both numerical simulations and hardware-in-the-loop experimental tests.

引用

页码：2218 / 2230

页数：13

共 32 条

[1] Immersion and invariance: A new tool for stabilization and adaptive control of nonlinear systems
Astolfi, A
Ortega, R
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (04) : 590 - 606
[2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
Bhasin, S.
Kamalapurkar, R.
Johnson, M.
Vamvoudakis, K. G.
Lewis, F. L.
Dixon, W. E.
[J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
[3] Exponential parameter and tracking error convergence guarantees for adaptive controllers without persistency of excitation
Chowdhary, Girish
Muehlegg, Maximilian
Johnson, Eric
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (08) : 1583 - 1603
[4] Theory and Flight-Test Validation of a Concurrent-Learning Adaptive Controller
Chowdhary, Girish V.
Johnson, Eric N.
[J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2011, 34 (02) : 592 - 607
[5] Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints
Dong, Hongyang
Zhao, Xiaowei
Yang, Haoyang
[J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2021, 29 (04) : 1664 - 1673
[6] Continuous Finite-Time Attitude Control for Rigid Spacecraft Based on Angular Velocity Observer
Hu, Qinglei
Jiang, Boyan
[J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2018, 54 (03) : 1082 - 1092
[7] Passivity-Based Attitude Synchronization in SE(3)
Igarashi, Yuji
Hatanaka, Takeshi
Fujita, Masayuki
Spong, Mark W.
[J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2009, 17 (05) : 1119 - 1134
[8] Ioannou P. A., 2012, ROBUST ADAPTIVE CONT
[9] Jiang, 2017, ROBUST ADAPTIVE DYNA
[10] Concurrent Learning for Parameter Estimation Using Dynamic State-Derivative Estimators
Kamalapurkar, Rushikesh
Reish, Benjamin
Chowdhary, Girish
Dixon, Warren E.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (07) : 3594 - 3601

← 1 2 3 4 →