Cascade control of underactuated manipulator based on reinforcement learning framework

被引：0

作者：

Jiang, Naijing ^{[1
,2
]}

Guo, Dingxu ^{[1
]}

Zhang, Shu ^{[1
]}

Zhang, Dan ^{[3
]}

Xu, Jian ^{[1
]}

机构：

[1] Tongji Univ, Sch Aerosp Engn & Appl Mech, 1239 Siping Rd, Shanghai 200092, Peoples R China

[2] Shanghai Microport Medbot, Shanghai, Peoples R China

[3] York Univ, Lassonde Sch Engn, Toronto, ON, Canada

来源：

PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING | 2023年 / 237卷 / 02期

基金：

美国国家科学基金会; 中国国家自然科学基金;

关键词：

Underactuation; rest-to-rest motion; reinforcement learning; path planning; VIBRATION CONTROL; SYSTEM; DESIGN; MOTION;

D O I：

10.1177/09596518221125533

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, we propose a cascade control framework to attenuate the residual vibration of the underactuated manipulator. The control framework is divided into two phases. In the first phase, a path generator trained by the reinforcement learning produces the leading signal for the tracking controller. In the second phase, the leading signal stabilizes the underactuated manipulator, and the adaptive proportional derivative controller is implemented to reduce the vibration. In the process, a novel path planning method is proposed to improve exploration efficiency, and a negative reward is introduced to avoid unsafe strategies and simulation instability. The effectiveness of the proposed control scheme is verified in the simulations of the double pendulum crane and the two-link flexible manipulator.

引用

页码：231 / 243

页数：13

共 50 条

[1] Trajectory planning for flexible Cartesian robot manipulator by using artificial neural network: numerical simulation and experimental verification [J].

Abe, Akira .

ROBOTICA, 2011, 29 :797-804

[2] Faster Motion on Cartesian Paths Exploiting Robot Redundancy at the Acceleration Level [J].

Al Khudir, Khaled ;

De Luca, Alessandro .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :3553-3560

[3] Deep Reinforcement Learning A brief survey [J].

Arulkumaran, Kai ;

Deisenroth, Marc Peter ;

Brundage, Miles ;

Bharath, Anil Anthony .

IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38

[4] Design and Optimal Control of an Underactuated Cable-Driven Micro-Macro Robot [J].

Barbazza, Luca ;

Zanotto, Damiano ;

Rosati, Giulio ;

Agrawal, Sunil K. .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (02) :896-903

[5] Robust point-to-point trajectory planning for nonlinear underactuated systems: Theory and experimental assessment [J].

Boscariol, Paolo ;

Richiedei, Dario .

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2018, 50 :256-265

[6] Trajectory Modified in Joint Space for Vibration Suppression of Manipulator [J].

Cao, Baoshi ;

Sun, Kui ;

Li, Tian ;

Gu, Yikun ;

Jin, Minghe ;

Liu, Hong .

IEEE ACCESS, 2018, 6 :57969-57980

[7] A Direct Method of Adaptive FIR Input Shaping for Motion Control With Zero Residual Vibration [J].

Cole, Matthew O. T. ;

Wongratanaphisan, Theeraphong .

IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2013, 18 (01) :316-327

[8]

Cong W., 2018, J DYN SYST MEAS CONT, V140

[9]

Fan CZ, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), P489, DOI 10.1109/RCAR.2016.7784078

[10] Discovery of the maximum principle [J].

Gamkrelidze R.V. .

Journal of Dynamical and Control Systems, 1999, 5 (4) :437-451

← 1 2 3 4 5 →