Multi-Modal Legged Locomotion Framework With Automated Residual Reinforcement Learning

被引:11
作者
Yu, Chen [1 ]
Rosendo, Andre [1 ]
机构
[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China
关键词
Evolutionary robotics; legged robots; multi-modal locomotion; reinforcement learning; HUMANOID ROBOTS; OPTIMIZATION; WALKING; DESIGN;
D O I
10.1109/LRA.2022.3191071
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
While quadruped robots usually have good stability and load capacity, bipedal robots offer a higher level of flexibility / adaptability to different tasks and environments. A multi-modal legged robot can take the best of both worlds. In this paper, we propose a multi-modal locomotion framework that is composed of a hand-crafted transition motion and a learning-based bipedal controller-learnt by a novel algorithm called Automated Residual Reinforcement Learning. This framework aims to endow arbitrary quadruped robots with the ability to walk bipedally. In particular, we 1) design an additional supporting structure for a quadruped robot and a sequential multi-modal transition strategy; 2) propose a novel class of Reinforcement Learning algorithms for bipedal control and evaluate their performances in both simulation and the real world. Experimental results show that our proposed algorithms have the best performance in simulation and maintain a good performance in a real-world robot. Overall, our multi-modal robot could successfully switch between biped and quadruped, and walk in both modes.
引用
收藏
页码:10312 / 10319
页数:8
相关论文
共 50 条
  • [1] MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework
    Wang, Puming
    Yang, Laurence T.
    Li, Jintao
    Li, Xue
    Zhou, Xiaokang
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2020, 13 (04) : 675 - 684
  • [2] Bayesian decomposition of multi-modal dynamical systems for reinforcement learning
    Kaiser, Markus
    Otte, Clemens
    Runkler, Thomas A.
    Ek, Carl Henrik
    NEUROCOMPUTING, 2020, 416 (352-359) : 352 - 359
  • [3] Learning to Climb: Constrained Contextual Bayesian Optimisation on a Multi-Modal Legged Robot
    Yu, Chen
    Cao, Jinyue
    Rosendo, Andre
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 9881 - 9888
  • [4] A general locomotion control framework for multi-legged locomotors
    Chong, Baxi
    Aydin, Yasemin O.
    Rieser, Jennifer M.
    Sartoretti, Guillaume
    Wang, Tianyu
    Whitman, Julian
    Kaba, Abdul
    Aydin, Enes
    McFarland, Ciera
    Cruz, Kelimar Diaz
    Rankin, Jeffery W.
    Michel, Krijn B.
    Nicieza, Alfredo
    Hutchinson, John R.
    Choset, Howie
    Goldman, Daniel, I
    BIOINSPIRATION & BIOMIMETICS, 2022, 17 (04)
  • [5] Perspectives on biologically inspired hybrid and multi-modal locomotion PREFACE
    Low, K. H.
    Hu, Tianjiang
    Mohammed, Samer
    Tangorra, James
    Kovac, Mirko
    BIOINSPIRATION & BIOMIMETICS, 2015, 10 (02)
  • [6] CTS: Concurrent Teacher-Student Reinforcement Learning for Legged Locomotion
    Wang, Hongxi
    Luo, Haoxiang
    Zhang, Wei
    Chen, Hua
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9191 - 9198
  • [7] Multi-agent Deep Reinforcement Learning for Multi-modal Orienteering Problem
    Liu, Wei
    Li, Kaiwen
    Li, Wenhua
    Wang, Rui
    Zhang, Tao
    18TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS, SACI 2024, 2024, : 169 - 174
  • [8] Tunable Multi-Modal Locomotion in Soft Dielectric Elastomer Robots
    Duduta, Mihai
    Berlinger, Florian
    Nagpal, Radhika
    Clarke, David R.
    Wood, Robert J.
    Temel, F. Zeynep
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) : 3868 - 3875
  • [9] Deep reinforcement learning for financial trading using multi-modal features
    Avramelou, Loukia
    Nousi, Paraskevi
    Passalis, Nikolaos
    Tefas, Anastasios
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [10] Exploring unknown environments with multi-modal locomotion swarm
    Ouarda, Zedadra
    Nicolas, Jouandeau
    Hamid, Seridi
    Giancarlo, Fortino
    INTELLIGENT DISTRIBUTED COMPUTING X, 2017, 678 : 131 - 140