Reinforcement learning control of a single-link flexible robotic manipulator

被引:70
作者
Ouyang, Yuncheng [1 ,2 ]
He, Wei [3 ]
Li, Xiajing [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Ctr Robot, Chengdu 611731, Peoples R China
[3] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[4] Delft Univ Technol, Delft Ctr Syst & Control, NL-2628 CD Delft, Netherlands
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
flexible manipulators; learning (artificial intelligence); manipulator kinematics; vibration control; lightweight structures; radial basis function networks; stability; Lyapunov methods; reinforcement learning control; single-link flexible robotic manipulator; lightweight structure; Lagrange equation; radial basis function neural networks; actor NN; system stability; Lyapunov direct method; Matlab simulation; Quanser flexible link platform; ADAPTIVE TRACKING CONTROL; VIBRATION CONTROL; FEEDBACK-CONTROL; STABILITY ANALYSIS; NONLINEAR-SYSTEMS; RESONANT CONTROL; CONSTRAINT; DESIGN; PARAMETERS; NETWORKS;
D O I
10.1049/iet-cta.2016.1540
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, the authors focus on the reinforcement learning control of a single-link flexible manipulator and attempt to suppress the vibration due to its flexibility and lightweight structure. The assumed mode method and the Lagrange's equation are adopted in modelling to enhance the satisfaction of precision. Two radial basis function neural networks (NNs) are employed in the designed control algorithm, actor NN for generating a policy and critic NN for evaluating the cost-to-go. Rigorous stability of the system has been proven via Lyapunov's direct method. Through Matlab simulation and experiment on the Quanser flexible link platform, the superiority and feasibility of the reinforcement learning control are verified.
引用
收藏
页码:1426 / 1433
页数:8
相关论文
共 50 条
[1]  
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f
[2]   Linear PID composite controller and its tuning for flexible link robots [J].
Cheong, Joono ;
Lee, Seungjin .
JOURNAL OF VIBRATION AND CONTROL, 2008, 14 (03) :291-318
[3]  
Clough R., 1975, Dynamics of Structures
[4]   Discrete time sliding mode control of robotic manipulators: Development and experimental validation [J].
Corradini, Maria Letizia ;
Fossi, Valentino ;
Giantomassi, Andrea ;
Ippoliti, Gianluca ;
Longhi, Sauro ;
Orlando, Giuseppe .
CONTROL ENGINEERING PRACTICE, 2012, 20 (08) :816-822
[5]   Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints [J].
He, Pingan ;
Jagannathan, S. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :425-436
[6]   Adaptive Control of a Flexible String System With Input Hysteresis [J].
He, Wei ;
Meng, Tingting .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (02) :693-700
[7]   Adaptive Boundary Iterative Learning Control for an Euler-Bernoulli Beam System With Input Constraint [J].
He, Wei ;
Meng, Tingting ;
Huang, Deqing ;
Li, Xuefang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (05) :1539-1549
[8]   Modeling and Vibration Control for a Moving Beam With Application in a Drilling Riser [J].
He, Wei ;
Nie, Shuangxi ;
Meng, Tingting ;
Liu, Yan-Jun .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2017, 25 (03) :1036-1043
[9]   Vibration Control of a Flexible Robotic Manipulator in the Presence of Input Deadzone [J].
He, Wei ;
Ouyang, Yuncheng ;
Hong, Jie .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (01) :48-59
[10]   Model Identification and Control Design for a Humanoid Robot [J].
He, Wei ;
Ge, Weiliang ;
Li, Yunchuan ;
Liu, Yan-Jun ;
Yang, Chenguang ;
Sun, Changyin .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (01) :45-57