Back-Stepping Experience Replay With Application to Model-Free Reinforcement Learning for a Soft Snake Robot

被引:1
作者
Qi, Xinda [1 ]
Chen, Dong [1 ]
Li, Zhaojian [2 ]
Tan, Xiaobo [1 ]
机构
[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 09期
基金
美国国家科学基金会;
关键词
Standards; Approximation algorithms; Snake robots; Task analysis; Trajectory; Training; Navigation; Deep reinforcement learning; experience replay; soft robot; snake robot; locomotion; navigation;
D O I
10.1109/LRA.2024.3427550
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter, we propose a novel technique, Back-stepping Experience Replay (BER), that is compatible with arbitrary off-policy reinforcement learning (RL) algorithms. BER aims to enhance learning efficiency in systems with approximate reversibility, reducing the need for complex reward shaping. The method constructs reversed trajectories using back-stepping transitions to reach random or fixed targets. Interpretable as a bi-directional approach, BER addresses inaccuracies in back-stepping transitions through a purification of the replay experience during learning. Given the intricate nature of soft robots and their complex interactions with environments, we present an application of BER in a model-free RL approach for the locomotion and navigation of a soft snake robot, which is capable of serpentine motion enabled by anisotropic friction between the body and ground. In addition, a dynamic simulator is developed to assess the effectiveness and efficiency of the BER algorithm, in which the robot demonstrates successful learning (reaching a 100% success rate) and adeptly reaches random targets, achieving an average speed 48% faster than that of the best baseline approach.
引用
收藏
页码:7517 / 7524
页数:8
相关论文
共 24 条
  • [1] Andrychowicz M., 2017, P ADV NEUR INF PROC, V30
  • [2] [Anonymous], 2017, Int. J. Control, Automat. Syst., V15, P3
  • [3] Badia Adria Puigdomenech, 2020, INT C LEARN REPR
  • [4] Ding YM, 2019, ADV NEUR IN, V32
  • [5] Fang M., 2019, P INT C LEARN REPR
  • [6] Fox L., 2018, P INT C LEARN REPR
  • [7] Hu D., 2012, NATURAL LOCOMOTION F, P117, DOI DOI 10.1007/978-1-4614-3997-4-8
  • [8] The mechanics of slithering locomotion
    Hu, David L.
    Nirody, Jasmine
    Scott, Terri
    Shelley, Michael J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (25) : 10081 - 10085
  • [9] Hua M, 2024, Arxiv, DOI arXiv:2312.11084
  • [10] Jitosho T. G. W., 2023, C ROBOT LEARNING, P1131