Optimal robust online tracking control for space manipulator in task space using off-policy reinforcement learning

被引:1
作者
Zhuang, Hongji [1 ]
Zhou, Hang [1 ]
Shen, Qiang [1 ]
Wu, Shufan [1 ,2 ]
Razoumny, Vladimir Yu. [2 ]
Razoumny, Yury N. [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Peoples Friendship Univ Russia, RUDN Univ, Moscow 117198, Russia
基金
中国国家自然科学基金;
关键词
Task space; Reinforcement learning; Online tracking control; H-INFINITY CONTROL; ROBOT MANIPULATORS; NONLINEAR-SYSTEMS; TIME-SYSTEMS; APPROXIMATION; STATE;
D O I
10.1016/j.ast.2024.109446
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This study addresses the demands for adaptability, uncertainty management, and high performance in the control of space manipulators, and the inadequacies in achieving optimal control and handling external uncertainty in task space in previous research. Based on off-policy reinforcement learning, a model-free and time-efficient method for online robust tracking control in task space is devised. To address the complexity of dynamic equations in task space, a mixed-variable approach is adopted to transform the multivariable coupled time- varying problem into a single-variable problem. Subsequently, the optimal control policy is derived with the disturbance convergence, stability, and optimality of the control method being demonstrated. This marks the first instance of achieving robust optimal tracking control in task space for space manipulators. The efficacy and superiority of the presented algorithm are validated through simulation.
引用
收藏
页数:12
相关论文
共 60 条
[11]   Adaptive optimal formation control for unmanned surface vehicles with guaranteed performance using actor-critic learning architecture [J].
Chen, Lin ;
Dong, Chao ;
He, Shude ;
Dai, Shi-Lu .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (08) :4504-4522
[12]   Adaptive Optimal Tracking Control of an Underactuated Surface Vessel Using Actor-Critic Reinforcement Learning [J].
Chen, Lin ;
Dai, Shi-Lu ;
Dong, Chao .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) :7520-7533
[13]   Optimal tracking control for unknown nonlinear systems with uncertain input saturation: A dynamic event-triggered ADP algorithm [J].
Chen, Lu ;
Hao, Fei .
NEUROCOMPUTING, 2024, 564
[14]   Value Iteration-Based Adaptive Fuzzy Backstepping Optimal Control of Modular Robot Manipulators via Integral Reinforcement Learning [J].
Dong, Bo ;
Jiang, Hucheng ;
Cui, Yiming ;
Zhu, Xinye ;
An, Tianjiao .
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2024, 26 (04) :1347-1363
[15]   An adaptive continuous sliding mode feedback linearization task space control for robot manipulators [J].
Elmogy, Ahmed ;
Elawady, Wael .
AIN SHAMS ENGINEERING JOURNAL, 2024, 15 (01)
[16]   Model-free adaptive task-space sliding mode control of a Delta robot using a novel reaching law [J].
Fateh, Alireza ;
Momeni, Hamidreza .
ISA TRANSACTIONS, 2024, 149 :69-80
[17]   UNIVERSAL APPROXIMATION OF AN UNKNOWN MAPPING AND ITS DERIVATIVES USING MULTILAYER FEEDFORWARD NETWORKS [J].
HORNIK, K ;
STINCHCOMBE, M ;
WHITE, H .
NEURAL NETWORKS, 1990, 3 (05) :551-560
[18]   Adaptive backstepping trajectory tracking control of robot manipulator [J].
Hu, Qinglei ;
Xu, Liang ;
Zhang, Aihua .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2012, 349 (03) :1087-1105
[19]   A tutorial on visual servo control [J].
Hutchinson, S ;
Hager, GD ;
Corke, PI .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1996, 12 (05) :651-670
[20]  
Jeffreys H., 1999, METHODS MATH PHYS