Back-Stepping Experience Replay With Application to Model-Free Reinforcement Learning for a Soft Snake Robot

被引：1

作者：

Qi, Xinda ^{[1
]}

Chen, Dong ^{[1
]}

Li, Zhaojian ^{[2
]}

Tan, Xiaobo ^{[1
]}

机构：

[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA

[2] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 09期

基金：

美国国家科学基金会;

关键词：

Standards; Approximation algorithms; Snake robots; Task analysis; Trajectory; Training; Navigation; Deep reinforcement learning; experience replay; soft robot; snake robot; locomotion; navigation;

D O I：

10.1109/LRA.2024.3427550

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In this letter, we propose a novel technique, Back-stepping Experience Replay (BER), that is compatible with arbitrary off-policy reinforcement learning (RL) algorithms. BER aims to enhance learning efficiency in systems with approximate reversibility, reducing the need for complex reward shaping. The method constructs reversed trajectories using back-stepping transitions to reach random or fixed targets. Interpretable as a bi-directional approach, BER addresses inaccuracies in back-stepping transitions through a purification of the replay experience during learning. Given the intricate nature of soft robots and their complex interactions with environments, we present an application of BER in a model-free RL approach for the locomotion and navigation of a soft snake robot, which is capable of serpentine motion enabled by anisotropic friction between the body and ground. In addition, a dynamic simulator is developed to assess the effectiveness and efficiency of the BER algorithm, in which the robot demonstrates successful learning (reaching a 100% success rate) and adeptly reaches random targets, achieving an average speed 48% faster than that of the best baseline approach.

引用

页码：7517 / 7524

页数：8

共 24 条

[1] Andrychowicz M., 2017, P ADV NEUR INF PROC, V30
[2] [Anonymous], 2017, Int. J. Control, Automat. Syst., V15, P3
[3] Badia Adria Puigdomenech, 2020, INT C LEARN REPR
[4] Ding YM, 2019, ADV NEUR IN, V32
[5] Fang M., 2019, P INT C LEARN REPR
[6] Fox L., 2018, P INT C LEARN REPR
[7] Hu D., 2012, NATURAL LOCOMOTION F, P117, DOI DOI 10.1007/978-1-4614-3997-4-8
[8] The mechanics of slithering locomotion
Hu, David L.
Nirody, Jasmine
Scott, Terri
Shelley, Michael J.
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (25) : 10081 - 10085
[9] Hua M, 2024, Arxiv, DOI arXiv:2312.11084
[10] Jitosho T. G. W., 2023, C ROBOT LEARNING, P1131

← 1 2 3 →