Reinforcement learning-based finite time control for the asymmetric underactuated tethered spacecraft with disturbances

被引：3

作者：

Lu, Yingbo ^{[1
]}

Wang, Xingyu ^{[1
]}

Liu, Ya ^{[2
]}

Huang, Panfeng ^{[3
]}

机构：

[1] Zhengzhou Univ Light Ind, Sch Elect Informat Engn, Zhengzhou 450001, Peoples R China

[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

[3] Northwestern Polytech Univ, Res Ctr Intelligent Robot, Sch Astronaut, Xian 710072, Peoples R China

来源：

ACTA ASTRONAUTICA | 2024年 / 220卷

基金：

中国国家自然科学基金;

关键词：

Asymmetric underactuated tethered spacecraft; Reinforcement learning; Finite time control; Actor-critic; STABILITY; SYSTEMS; DESIGN;

D O I：

10.1016/j.actaastro.2024.04.014

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This article addresses an attitude stabilization control problem for the asymmetric underactuated tethered spacecraft subject to external disturbances, and a reinforcement learning(RL)-based finite time control scheme is proposed to enhance the control performance and energy efficiency of the closed-loop system. Firstly, the error dynamics of the underactuated tethered system in the presence of external disturbances is built based on the Lagrange's modeling technique. Then, a RL-based control algorithm is implemented by a radial basis function (RBF) neural network (NN), in which the actor-critic networks are developed to obtain the optimal performance index function and the optimal controller. According to the Lyapunov theorem, semi-global finite- time stability of all the closed-loop signals is achieved through rigorous mathematical analysis, and tracking errors can be ensured to an arbitrarily small neighborhood of the origin in a finite time. Finally, comparative simulation results with hierarchical sliding mode controller are presented to demonstrate the viability of the proposed strategy.

引用

页码：218 / 229

页数：12

共 34 条

[1] Reinforcement Learning-Based Fixed-Time Trajectory Tracking Control for Uncertain Robotic Manipulators With Input Saturation [J].

Cao, Shengjie ;

Sun, Liang ;

Jiang, Jingjing ;

Zuo, Zongyu .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) :4584-4595

[2] Adaptive Optimal Tracking Control of an Underactuated Surface Vessel Using Actor-Critic Reinforcement Learning [J].

Chen, Lin ;

Dai, Shi-Lu ;

Dong, Chao .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) :7520-7533

[3] Adaptive sliding mode control for deployment of electro-dynamic tether via limited tension and current [J].

Chen, Shumin ;

Li, Aijun ;

Wang, Changqing ;

Liu, Chenguang .

ACTA ASTRONAUTICA, 2020, 177 :842-852

[4] Global Tracking Control of Underactuated Ships With Input and Velocity Constraints Using Dynamic Surface Control Method [J].

Chwa, Dongkyoung .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2011, 19 (06) :1357-1370

[5] Finite-time control strategy for the running of a telescopic leg biped robot [J].

Doosti, Pouya ;

Mahjoob, M. J. ;

Dadashzadeh, B. .

JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2019, 41 (04)

[6] Robust Adaptive Super-Twisting Sliding Mode Stability Control of Underactuated Rotational Inverted Pendulum With Experimental Validation [J].

El-Sousy, Fayez F. M. ;

Alattas, Khalid A. ;

Mofid, Omid ;

Mobayen, Saleh ;

Fekih, Afef .

IEEE ACCESS, 2022, 10 :100857-100866

[7] Robust Adaptive Dynamic Programming of Two-Player Zero-Sum Games for Continuous-Time Linear Systems [J].

Fu, Yue ;

Fu, Jun ;

Chai, Tianyou .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (12) :3314-3319

[8] Learning-Based Trajectory Tracking and Balance Control for Bicycle Robots With a Pendulum: A Gaussian Process Approach [J].

He, Kanghui ;

Deng, Yang ;

Wang, Guanghan ;

Sun, Xiangyu ;

Sun, Yiyong ;

Chen, Zhang .

IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (02) :634-644

[9] Dexterous Tethered Space Robot: Design, Measurement, Control, and Experiment [J].

Huang, Panfeng ;

Zhang, Fan ;

Cai, Jia ;

Wang, Dongke ;

Meng, Zhongjie ;

Guo, Jian .

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2017, 53 (03) :1452-1468

[10] Finite-time control of underactuated spacecraft hovering [J].

Huang, Xu ;

Yan, Ye ;

Huang, Zherui .

CONTROL ENGINEERING PRACTICE, 2017, 68 :46-62

← 1 2 3 4 →