Control of a piecewise constant curvature continuum manipulator via policy search method

被引:0
|
作者
Huang, Sheng [1 ]
Zhang, Qiyuan [1 ]
Liu, Zhaoyang [2 ]
Wang, Xueqian [1 ]
Liang, Bin [2 ,3 ]
机构
[1] Tsinghua Univ, Ctr Artificial Intelligence & Robot, Grad Sch Shenzhen, Shenzhen 518055, Peoples R China
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[3] Tsinghua Univ Shenzhen, Res Inst, Shenzhen 518057, Peoples R China
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO) | 2018年
基金
中国国家自然科学基金;
关键词
Piecewise constant curvature; continuum robot; reinforcement learning; guided policy search; DYNAMICS; ROBOTS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate dynamic models are necessary for fast and precise motion control of continuum manipulators, but can hardly be obtained. In this paper, we propose a method for precise position control of piecewise constant curvature (PCC) continuum manipulators using the guided policy search method. The policy network is trained to control the manipulator to reach target position precisely. Firstly, a linear time-varying Gaussian model fitting to sample trajectories is used to approximate the dynamics of the PCC continuum manipulator. Then, the Gaussian controller that induces the trajectory distribution is optimized by iterative LQR based on the fitted model. Finally, the policy is updated by minimizing the KL-divergence between the policy and trajectory distribution. We have simulated the PCC continuum manipulator with two-segment model and three-segment model, respectively. The results show that the linear time-varying Gaussian model can describe the dynamics of the PCC continuum manipulator and the proposed method can achieve desired performance of position control within a couple of iterations.
引用
收藏
页码:1777 / 1782
页数:6
相关论文
共 6 条
  • [1] Kinematic Modeling of Handed Shearing Auxetics via Piecewise Constant Curvature
    Garg, Aman
    Good, Ian
    Revier, Daniel
    Airis, Kevin
    Lipton, Jeffrey
    2022 IEEE 5TH INTERNATIONAL CONFERENCE ON SOFT ROBOTICS (ROBOSOFT), 2022, : 423 - 430
  • [2] Optimizing a Continuum Manipulator's Search Policy Through Model-Free Reinforcement Learning
    Frazelle, Chase
    Rogers, Jonathan
    Karamouzas, Ioannis
    Walker, Ian
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5564 - 5571
  • [3] Modeling and Simulation of Passive Catheter based on Piecewise Constant Curvature Method
    Wang, Yuan
    Guo, Shuxiang
    Xiao, Nan
    2014 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2014), 2014, : 958 - 963
  • [4] Bayesian Learning for Policy Search in Trajectory Control of a Planar Manipulator
    Aghaei, Vahid Tavakol
    Agababaoglu, Arda
    Onat, Ahmet
    Yildirim, Sinan
    2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 240 - 246
  • [5] Static modeling and analysis of soft manipulator considering environment contact based on segmented constant curvature method
    Chen, Yinglong
    Li, Wenshuo
    Gong, Yongjun
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2021, 48 (02): : 233 - 246
  • [6] Combining Soft-Actor Critic with Cross-Entropy Method for Policy Search in Continuous Control
    Hieu Trung Nguyen
    Khang Tran
    Ngoc Hoang Luong
    2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,