Control of a piecewise constant curvature continuum manipulator via policy search method

被引：0

作者：

Huang, Sheng ^{[1
]}

Zhang, Qiyuan ^{[1
]}

Liu, Zhaoyang ^{[2
]}

Wang, Xueqian ^{[1
]}

Liang, Bin ^{[2
,3
]}

机构：

[1] Tsinghua Univ, Ctr Artificial Intelligence & Robot, Grad Sch Shenzhen, Shenzhen 518055, Peoples R China

[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

[3] Tsinghua Univ Shenzhen, Res Inst, Shenzhen 518057, Peoples R China

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO) | 2018年

基金：

中国国家自然科学基金;

关键词：

Piecewise constant curvature; continuum robot; reinforcement learning; guided policy search; DYNAMICS; ROBOTS;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Accurate dynamic models are necessary for fast and precise motion control of continuum manipulators, but can hardly be obtained. In this paper, we propose a method for precise position control of piecewise constant curvature (PCC) continuum manipulators using the guided policy search method. The policy network is trained to control the manipulator to reach target position precisely. Firstly, a linear time-varying Gaussian model fitting to sample trajectories is used to approximate the dynamics of the PCC continuum manipulator. Then, the Gaussian controller that induces the trajectory distribution is optimized by iterative LQR based on the fitted model. Finally, the policy is updated by minimizing the KL-divergence between the policy and trajectory distribution. We have simulated the PCC continuum manipulator with two-segment model and three-segment model, respectively. The results show that the linear time-varying Gaussian model can describe the dynamics of the PCC continuum manipulator and the proposed method can achieve desired performance of position control within a couple of iterations.

引用

页码：1777 / 1782

页数：6

共 6 条

[1] Kinematic Modeling of Handed Shearing Auxetics via Piecewise Constant Curvature
Garg, Aman
Good, Ian
Revier, Daniel
Airis, Kevin
Lipton, Jeffrey
2022 IEEE 5TH INTERNATIONAL CONFERENCE ON SOFT ROBOTICS (ROBOSOFT), 2022, : 423 - 430
[2] Optimizing a Continuum Manipulator's Search Policy Through Model-Free Reinforcement Learning
Frazelle, Chase
Rogers, Jonathan
Karamouzas, Ioannis
Walker, Ian
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5564 - 5571
[3] Modeling and Simulation of Passive Catheter based on Piecewise Constant Curvature Method
Wang, Yuan
Guo, Shuxiang
Xiao, Nan
2014 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2014), 2014, : 958 - 963
[4] Bayesian Learning for Policy Search in Trajectory Control of a Planar Manipulator
Aghaei, Vahid Tavakol
Agababaoglu, Arda
Onat, Ahmet
Yildirim, Sinan
2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 240 - 246
[5] Static modeling and analysis of soft manipulator considering environment contact based on segmented constant curvature method
Chen, Yinglong
Li, Wenshuo
Gong, Yongjun
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2021, 48 (02): : 233 - 246
[6] Combining Soft-Actor Critic with Cross-Entropy Method for Policy Search in Continuous Control
Hieu Trung Nguyen
Khang Tran
Ngoc Hoang Luong
2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,

← 1 →