共 34 条
Reinforcement learning-based finite time control for the asymmetric underactuated tethered spacecraft with disturbances
被引:3
作者:

Lu, Yingbo
论文数: 0 引用数: 0
h-index: 0
机构:
Zhengzhou Univ Light Ind, Sch Elect Informat Engn, Zhengzhou 450001, Peoples R China Zhengzhou Univ Light Ind, Sch Elect Informat Engn, Zhengzhou 450001, Peoples R China

Wang, Xingyu
论文数: 0 引用数: 0
h-index: 0
机构:
Zhengzhou Univ Light Ind, Sch Elect Informat Engn, Zhengzhou 450001, Peoples R China Zhengzhou Univ Light Ind, Sch Elect Informat Engn, Zhengzhou 450001, Peoples R China

Liu, Ya
论文数: 0 引用数: 0
h-index: 0
机构:
Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China Zhengzhou Univ Light Ind, Sch Elect Informat Engn, Zhengzhou 450001, Peoples R China

Huang, Panfeng
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Res Ctr Intelligent Robot, Sch Astronaut, Xian 710072, Peoples R China Zhengzhou Univ Light Ind, Sch Elect Informat Engn, Zhengzhou 450001, Peoples R China
机构:
[1] Zhengzhou Univ Light Ind, Sch Elect Informat Engn, Zhengzhou 450001, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[3] Northwestern Polytech Univ, Res Ctr Intelligent Robot, Sch Astronaut, Xian 710072, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Asymmetric underactuated tethered spacecraft;
Reinforcement learning;
Finite time control;
Actor-critic;
STABILITY;
SYSTEMS;
DESIGN;
D O I:
10.1016/j.actaastro.2024.04.014
中图分类号:
V [航空、航天];
学科分类号:
08 ;
0825 ;
摘要:
This article addresses an attitude stabilization control problem for the asymmetric underactuated tethered spacecraft subject to external disturbances, and a reinforcement learning(RL)-based finite time control scheme is proposed to enhance the control performance and energy efficiency of the closed-loop system. Firstly, the error dynamics of the underactuated tethered system in the presence of external disturbances is built based on the Lagrange's modeling technique. Then, a RL-based control algorithm is implemented by a radial basis function (RBF) neural network (NN), in which the actor-critic networks are developed to obtain the optimal performance index function and the optimal controller. According to the Lyapunov theorem, semi-global finite- time stability of all the closed-loop signals is achieved through rigorous mathematical analysis, and tracking errors can be ensured to an arbitrarily small neighborhood of the origin in a finite time. Finally, comparative simulation results with hierarchical sliding mode controller are presented to demonstrate the viability of the proposed strategy.
引用
收藏
页码:218 / 229
页数:12
相关论文
共 34 条
[1]
Reinforcement Learning-Based Fixed-Time Trajectory Tracking Control for Uncertain Robotic Manipulators With Input Saturation
[J].
Cao, Shengjie
;
Sun, Liang
;
Jiang, Jingjing
;
Zuo, Zongyu
.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS,
2023, 34 (08)
:4584-4595

Cao, Shengjie
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Key Lab Knowledge Automat Ind Proc, Minist Educ, Beijing 100083, Peoples R China
Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing 100083, Peoples R China Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Key Lab Knowledge Automat Ind Proc, Minist Educ, Beijing 100083, Peoples R China

Sun, Liang
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Key Lab Knowledge Automat Ind Proc, Minist Educ, Beijing 100083, Peoples R China
Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing 100083, Peoples R China Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Key Lab Knowledge Automat Ind Proc, Minist Educ, Beijing 100083, Peoples R China

Jiang, Jingjing
论文数: 0 引用数: 0
h-index: 0
机构:
Loughborough Univ, Dept Aeronaut & Automot Engn, Loughborough LE11 3TU, Leics, England Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Key Lab Knowledge Automat Ind Proc, Minist Educ, Beijing 100083, Peoples R China

Zuo, Zongyu
论文数: 0 引用数: 0
h-index: 0
机构:
Beihang Univ, Res Div 7, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Key Lab Knowledge Automat Ind Proc, Minist Educ, Beijing 100083, Peoples R China
[2]
Adaptive Optimal Tracking Control of an Underactuated Surface Vessel Using Actor-Critic Reinforcement Learning
[J].
Chen, Lin
;
Dai, Shi-Lu
;
Dong, Chao
.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS,
2024, 35 (06)
:7520-7533

Chen, Lin
论文数: 0 引用数: 0
h-index: 0
机构:
South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China

Dai, Shi-Lu
论文数: 0 引用数: 0
h-index: 0
机构:
South China Univ Technol, Sch Automat Sci & Engn, Key Lab Autonomous Syst & Networked Control, Minist Educ, Guangzhou 510641, Peoples R China
South China Univ Technol, Unmanned Aerial Vehicle Syst Engn Technol Res Ctr, Guangzhou 510641, Peoples R China
Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519000, Peoples R China South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China

Dong, Chao
论文数: 0 引用数: 0
h-index: 0
机构:
Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519000, Peoples R China
South China Sea Marine Survey & Technol Ctr, Guangzhou 510300, Peoples R China
Minist Nat Resources, Key Lab Marine Environm Survey Technol & Applicat, Guangzhou 510300, Peoples R China South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[3]
Adaptive sliding mode control for deployment of electro-dynamic tether via limited tension and current
[J].
Chen, Shumin
;
Li, Aijun
;
Wang, Changqing
;
Liu, Chenguang
.
ACTA ASTRONAUTICA,
2020, 177
:842-852

Chen, Shumin
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, 1 Dongxiang Rd, Xian 710129, Shaanxi, Peoples R China Northwestern Polytech Univ, Sch Automat, 1 Dongxiang Rd, Xian 710129, Shaanxi, Peoples R China

Li, Aijun
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, 1 Dongxiang Rd, Xian 710129, Shaanxi, Peoples R China Northwestern Polytech Univ, Sch Automat, 1 Dongxiang Rd, Xian 710129, Shaanxi, Peoples R China

Wang, Changqing
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, 1 Dongxiang Rd, Xian 710129, Shaanxi, Peoples R China Northwestern Polytech Univ, Sch Automat, 1 Dongxiang Rd, Xian 710129, Shaanxi, Peoples R China

Liu, Chenguang
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, 1 Dongxiang Rd, Xian 710129, Shaanxi, Peoples R China Northwestern Polytech Univ, Sch Automat, 1 Dongxiang Rd, Xian 710129, Shaanxi, Peoples R China
[4]
Global Tracking Control of Underactuated Ships With Input and Velocity Constraints Using Dynamic Surface Control Method
[J].
Chwa, Dongkyoung
.
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY,
2011, 19 (06)
:1357-1370

论文数: 引用数:
h-index:
机构:
[5]
Finite-time control strategy for the running of a telescopic leg biped robot
[J].
Doosti, Pouya
;
Mahjoob, M. J.
;
Dadashzadeh, B.
.
JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING,
2019, 41 (04)

Doosti, Pouya
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Tehran, Coll Engn, Sch Mech Engn, Tehran, Iran Univ Tehran, Coll Engn, Sch Mech Engn, Tehran, Iran

Mahjoob, M. J.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Tehran, Coll Engn, Sch Mech Engn, Tehran, Iran Univ Tehran, Coll Engn, Sch Mech Engn, Tehran, Iran

Dadashzadeh, B.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Tabriz, Sch Engn Emerging Technol, Mechatron Engn Dept, Tabriz, Iran Univ Tehran, Coll Engn, Sch Mech Engn, Tehran, Iran
[6]
Robust Adaptive Super-Twisting Sliding Mode Stability Control of Underactuated Rotational Inverted Pendulum With Experimental Validation
[J].
El-Sousy, Fayez F. M.
;
Alattas, Khalid A.
;
Mofid, Omid
;
Mobayen, Saleh
;
Fekih, Afef
.
IEEE ACCESS,
2022, 10
:100857-100866

El-Sousy, Fayez F. M.
论文数: 0 引用数: 0
h-index: 0
机构:
Prince Sattam Bin Abdulaziz Univ, Coll Engn, Dept Elect Engn, Al Kharj 16278, Saudi Arabia Prince Sattam Bin Abdulaziz Univ, Coll Engn, Dept Elect Engn, Al Kharj 16278, Saudi Arabia

Alattas, Khalid A.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Jeddah, Coll Comp Sci & Engn, Dept Comp Sci & Artificial Intelligence, Jeddah 23218, Saudi Arabia Prince Sattam Bin Abdulaziz Univ, Coll Engn, Dept Elect Engn, Al Kharj 16278, Saudi Arabia

Mofid, Omid
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Yunlin Univ Sci & Technol, Future Technol Res Ctr, Touliu 64002, Yunlin, Taiwan Prince Sattam Bin Abdulaziz Univ, Coll Engn, Dept Elect Engn, Al Kharj 16278, Saudi Arabia

Mobayen, Saleh
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Yunlin Univ Sci & Technol, Future Technol Res Ctr, Touliu 64002, Yunlin, Taiwan Prince Sattam Bin Abdulaziz Univ, Coll Engn, Dept Elect Engn, Al Kharj 16278, Saudi Arabia

Fekih, Afef
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Louisiana Lafayette, Dept Elect & Comp Engn, Lafayette, LA 70504 USA Prince Sattam Bin Abdulaziz Univ, Coll Engn, Dept Elect Engn, Al Kharj 16278, Saudi Arabia
[7]
Robust Adaptive Dynamic Programming of Two-Player Zero-Sum Games for Continuous-Time Linear Systems
[J].
Fu, Yue
;
Fu, Jun
;
Chai, Tianyou
.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS,
2015, 26 (12)
:3314-3319

Fu, Yue
论文数: 0 引用数: 0
h-index: 0
机构:
Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

Fu, Jun
论文数: 0 引用数: 0
h-index: 0
机构:
Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

Chai, Tianyou
论文数: 0 引用数: 0
h-index: 0
机构:
Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[8]
Learning-Based Trajectory Tracking and Balance Control for Bicycle Robots With a Pendulum: A Gaussian Process Approach
[J].
He, Kanghui
;
Deng, Yang
;
Wang, Guanghan
;
Sun, Xiangyu
;
Sun, Yiyong
;
Chen, Zhang
.
IEEE-ASME TRANSACTIONS ON MECHATRONICS,
2022, 27 (02)
:634-644

He, Kanghui
论文数: 0 引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

Deng, Yang
论文数: 0 引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

Wang, Guanghan
论文数: 0 引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

Sun, Xiangyu
论文数: 0 引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

Sun, Yiyong
论文数: 0 引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

Chen, Zhang
论文数: 0 引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[9]
Dexterous Tethered Space Robot: Design, Measurement, Control, and Experiment
[J].
Huang, Panfeng
;
Zhang, Fan
;
Cai, Jia
;
Wang, Dongke
;
Meng, Zhongjie
;
Guo, Jian
.
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS,
2017, 53 (03)
:1452-1468

Huang, Panfeng
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China
Northwestern Polytech Univ, Res Ctr Intelligent Robot, Sch Astronaut, Xian 710072, Peoples R China Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China

Zhang, Fan
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China
Northwestern Polytech Univ, Res Ctr Intelligent Robot, Sch Astronaut, Xian 710072, Peoples R China Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China

Cai, Jia
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China
Northwestern Polytech Univ, Res Ctr Intelligent Robot, Sch Astronaut, Xian 710072, Peoples R China Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China

Wang, Dongke
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China
Northwestern Polytech Univ, Res Ctr Intelligent Robot, Sch Astronaut, Xian 710072, Peoples R China Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China

Meng, Zhongjie
论文数: 0 引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China
Northwestern Polytech Univ, Res Ctr Intelligent Robot, Sch Astronaut, Xian 710072, Peoples R China Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China

Guo, Jian
论文数: 0 引用数: 0
h-index: 0
机构:
Delft Univ Technol, Delft, Netherlands Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Sch Astronaut, Xian 710072, Peoples R China
[10]
Finite-time control of underactuated spacecraft hovering
[J].
Huang, Xu
;
Yan, Ye
;
Huang, Zherui
.
CONTROL ENGINEERING PRACTICE,
2017, 68
:46-62

Huang, Xu
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Hunan, Peoples R China Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Hunan, Peoples R China

Yan, Ye
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Hunan, Peoples R China Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Hunan, Peoples R China

Huang, Zherui
论文数: 0 引用数: 0
h-index: 0
机构:
Nanjing Univ, Dept Math, Nanjing 210093, Jiangsu, Peoples R China Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Hunan, Peoples R China