Real-Time Adaptive Control of a Flexible Manipulator Using Reinforcement Learning

被引:85
作者
Pradhan, Santanu Kumar [1 ]
Subudhi, Bidyadhar [1 ]
机构
[1] Natl Inst Technol, Dept Elect Engn, Rourkela 769008, Orissa, India
关键词
Adaptive control; flexible-link manipulator; reinforcement learning; tip trajectory tracking; LINK;
D O I
10.1109/TASE.2012.2189004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper exploits reinforcement learning (RL) for developing real-time adaptive control of tip trajectory and deflection of a two-link flexible manipulator handling variable payloads. This proposed adaptive controller consists of a proportional derivative (PD) tracking loop and an actor-critic-based RL loop that adapts the actor and critic weights in response to payload variations while suppressing the tip deflection and tracking the desired trajectory. The actor-critic-based RL loop uses a recursive least square (RLS)-based temporal difference (TD) learning with eligibility trace and an adaptive memory to estimate the critic weights and a gradient-based estimator for estimating actor weights. Tip trajectory tracking and suppression of tip deflection performances of the proposed RL-based adaptive controller (RLAC) are compared with that of a nonlinear regression-based direct adaptive controller (DAC) and a fuzzy learning-based adaptive controller (FLAC). Simulation and experimental results envisage that the RLAC outperforms both the DAC and FLAC. Note to Practitioners-This paper shows how to control a system with distributed flexibility. The reinforcement learning approach to develop adaptive control described in the paper can be applied to control also complex flexible space shuttle system and for damping of many vibratory systems.
引用
收藏
页码:237 / 249
页数:13
相关论文
共 12 条
[1]  
Bradtke SJ, 1996, MACH LEARN, V22, P33, DOI 10.1007/BF00114723
[2]   CLOSED-FORM DYNAMIC-MODEL OF PLANAR MULTILINK LIGHTWEIGHT ROBOTS [J].
DELUCA, A ;
SICILIANO, B .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1991, 21 (04) :826-839
[3]   Adaptive Control of a Single-Link Flexible Manipulator [J].
Feliu, Vincente ;
Rattan, Kuldip S. ;
Brown, H. Benjamin, Jr. .
IEEE CONTROL SYSTEMS MAGAZINE, 1990, 10 (02) :29-33
[4]   Implementation of a neural network tracking controller for a single flexible link: Comparison with PD and PID controllers [J].
Gutierrez, LB ;
Lewis, FL ;
Lowe, JA .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 1998, 45 (02) :307-318
[5]   FUZZY LEARNING CONTROL FOR A FLEXIBLE-LINK ROBOT [J].
MOUDGAL, VG ;
KWONG, WA ;
PASSINO, KM ;
YURKOVICH, S .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1995, 3 (02) :199-210
[6]   Reinforcement Learning With Function Approximation for Traffic Signal Control [J].
Prashanth, L. A. ;
Bhatnagar, Shalabh .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (02) :412-421
[7]   Experimental results on discrete-time nonlinear adaptive tracking control of a flexible-link manipulator [J].
Rokui, MR ;
Khorasani, K .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (01) :151-164
[8]   ADAPTIVE MANIPULATOR CONTROL - A CASE-STUDY [J].
SLOTINE, JJE ;
LI, WP .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1988, 33 (11) :995-1003
[9]   Soft computing methods applied to the control of a flexible robot manipulator [J].
Subudhi, B. ;
Morris, A. S. .
APPLIED SOFT COMPUTING, 2009, 9 (01) :149-158
[10]  
Sutton R.S., 2017, Introduction to reinforcement learning