Reinforcement learning control of a single-link flexible robotic manipulator

被引：70

作者：

Ouyang, Yuncheng ^{[1
,2
]}

He, Wei ^{[3
]}

Li, Xiajing ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 611731, Peoples R China

[2] Univ Elect Sci & Technol China, Ctr Robot, Chengdu 611731, Peoples R China

[3] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China

[4] Delft Univ Technol, Delft Ctr Syst & Control, NL-2628 CD Delft, Netherlands

来源：

IET CONTROL THEORY AND APPLICATIONS | 2017年 / 11卷 / 09期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

flexible manipulators; learning (artificial intelligence); manipulator kinematics; vibration control; lightweight structures; radial basis function networks; stability; Lyapunov methods; reinforcement learning control; single-link flexible robotic manipulator; lightweight structure; Lagrange equation; radial basis function neural networks; actor NN; system stability; Lyapunov direct method; Matlab simulation; Quanser flexible link platform; ADAPTIVE TRACKING CONTROL; VIBRATION CONTROL; FEEDBACK-CONTROL; STABILITY ANALYSIS; NONLINEAR-SYSTEMS; RESONANT CONTROL; CONSTRAINT; DESIGN; PARAMETERS; NETWORKS;

D O I：

10.1049/iet-cta.2016.1540

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this study, the authors focus on the reinforcement learning control of a single-link flexible manipulator and attempt to suppress the vibration due to its flexibility and lightweight structure. The assumed mode method and the Lagrange's equation are adopted in modelling to enhance the satisfaction of precision. Two radial basis function neural networks (NNs) are employed in the designed control algorithm, actor NN for generating a policy and critic NN for evaluating the cost-to-go. Rigorous stability of the system has been proven via Lyapunov's direct method. Through Matlab simulation and experiment on the Quanser flexible link platform, the superiority and feasibility of the reinforcement learning control are verified.

引用

页码：1426 / 1433

页数：8

共 50 条

[11] Adaptive Neural Impedance Control of a Robotic Manipulator With Input Saturation [J].

He, Wei ;

Dong, Yiting ;

Sun, Changyin .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 46 (03) :334-344

[12] Neural Network Control of a Rehabilitation Robot by State and Output Feedback [J].

He, Wei ;

Ge, Shuzhi Sam ;

Li, Yanan ;

Chew, Effie ;

Ng, Yee Sien .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 80 (01) :15-31

[13] Modeling and Vibration Control for a Nonlinear Moving String With Output Constraint [J].

He, Wei ;

Ge, Shuzhi Sam ;

Huang, Deqing .

IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2015, 20 (04) :1886-1897

[14] Adaptive Boundary Control of a Nonlinear Flexible String System [J].

He, Wei ;

Zhang, Shuang ;

Ge, Shuzhi Sam .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2014, 22 (03) :1088-1093

[15] Reinforcement learning: A survey [J].

Kaelbling, LP ;

Littman, ML ;

Moore, AW .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285

[16] On Input-to-State Stability of Switched Stochastic Nonlinear Systems Under Extended Asynchronous Switching [J].

Kang, Yu ;

Zhai, Di-Hua ;

Liu, Guo-Ping ;

Zhao, Yun-Bo .

IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (05) :1092-1105

[17] Stability Analysis of A Class of Hybrid Stochastic Retarded Systems Under Asynchronous Switching [J].

Kang, Yu ;

Zhai, Di-Hua ;

Liu, Guo-Ping ;

Zhao, Yun-Bo ;

Zhao, Ping .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (06) :1511-1523

[18] Review of Control and Sensor System of Flexible Manipulator [J].

Kiang, Chang Tai ;

Spowage, Andrew ;

Yoong, Chan Kuan .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 77 (01) :187-213

[19] Mathematical modeling and trajectory planning of mobile manipulators with flexible links and joints [J].

Korayem, Moharam Habibnejad ;

Rahimi, H. N. ;

Nikoobin, A. .

APPLIED MATHEMATICAL MODELLING, 2012, 36 (07) :3223-3238

[20] Neural network based hybrid force/position control for robot manipulators [J].

Kumar, Naveen ;

Panwar, Vikas ;

Sukavanam, Nagarajan ;

Sharma, Prakash ;

Borm, Jin-Hwan .

INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2011, 12 (03) :419-426

← 1 2 3 4 5 →