Real-Time Adaptive Control of a Flexible Manipulator Using Reinforcement Learning

被引：85

作者：

Pradhan, Santanu Kumar ^{[1
]}

Subudhi, Bidyadhar ^{[1
]}

机构：

[1] Natl Inst Technol, Dept Elect Engn, Rourkela 769008, Orissa, India

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2012年 / 9卷 / 02期

关键词：

Adaptive control; flexible-link manipulator; reinforcement learning; tip trajectory tracking; LINK;

D O I：

10.1109/TASE.2012.2189004

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper exploits reinforcement learning (RL) for developing real-time adaptive control of tip trajectory and deflection of a two-link flexible manipulator handling variable payloads. This proposed adaptive controller consists of a proportional derivative (PD) tracking loop and an actor-critic-based RL loop that adapts the actor and critic weights in response to payload variations while suppressing the tip deflection and tracking the desired trajectory. The actor-critic-based RL loop uses a recursive least square (RLS)-based temporal difference (TD) learning with eligibility trace and an adaptive memory to estimate the critic weights and a gradient-based estimator for estimating actor weights. Tip trajectory tracking and suppression of tip deflection performances of the proposed RL-based adaptive controller (RLAC) are compared with that of a nonlinear regression-based direct adaptive controller (DAC) and a fuzzy learning-based adaptive controller (FLAC). Simulation and experimental results envisage that the RLAC outperforms both the DAC and FLAC. Note to Practitioners-This paper shows how to control a system with distributed flexibility. The reinforcement learning approach to develop adaptive control described in the paper can be applied to control also complex flexible space shuttle system and for damping of many vibratory systems.

引用

页码：237 / 249

页数：13

共 12 条

[1]

Bradtke SJ, 1996, MACH LEARN, V22, P33, DOI 10.1007/BF00114723

[2] CLOSED-FORM DYNAMIC-MODEL OF PLANAR MULTILINK LIGHTWEIGHT ROBOTS [J].