Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World

被引:31
|
作者
Smith, Laura [1 ]
Kew, J. Chase [2 ]
Peng, Xue Bin [1 ]
Ha, Sehoon [2 ,3 ]
Tang, Jie [2 ]
Levine, Sergey [1 ,2 ]
机构
[1] Univ Calif Berkeley, Berkeley AI Res, Berkeley, CA 90095 USA
[2] Google Res, New York, NY USA
[3] Gcorgia Inst Technol, Atlanta, GA USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022) | 2022年
关键词
D O I
10.1109/ICRA46639.2022.9812166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Legged robots are physically capable of traversing a wide range of challenging environments, but designing controllers that are sufficiently robust to handle this diversity has been a long-standing challenge in robotics. Reinforcement learning presents an appealing approach for automating the controller design process and has been able to produce remarkably robust controllers when trained in a suitable range of environments. However, it is difficult to predict all likely conditions the robot will encounter during deployment and enumerate them at training-time. What if instead of training controllers that are robust enough to handle any eventuality, we enable the robot to continually learn in any setting it finds itself in? This kind of real-world reinforcement learning poses a number of challenges, including efficiency, safety, and autonomy. To address these challenges, we propose a practical robot reinforcement learning system for fine-tuning locomotion policies in the real world. We demonstrate that a modest amount of real-world training can substantially improve performance during deployment, and this enables a real A1 quadrupedal robot to autonomously fine-tune multiple locomotion skills in a range of environments, including an outdoor lawn and a variety of indoor terrains. (Videos and code1)
引用
收藏
页码:1593 / 1599
页数:7
相关论文
共 50 条
  • [21] Editorial: Towards Real-World Deployment of Legged Robots
    Kottege, Navinda
    Sentis, Luis
    Kanoulas, Dimitrios
    FRONTIERS IN ROBOTICS AND AI, 2022, 8
  • [22] A Fine-Tuning Strategy Based on Real Scenes in Gait Identification
    Zhang, Xianggang
    Zeng, Jing
    Wang, Guoyu
    UBIQUITOUS SECURITY, 2022, 1557 : 336 - 350
  • [23] Mammalian microRNAs: a small world for fine-tuning gene expression
    Sevignani, C
    Calin, GA
    Siracusa, LD
    Croce, CM
    MAMMALIAN GENOME, 2006, 17 (03) : 189 - 202
  • [24] Mammalian microRNAs: a small world for fine-tuning gene expression
    Cinzia Sevignani
    George A. Calin
    Linda D. Siracusa
    Carlo M. Croce
    Mammalian Genome, 2006, 17 : 189 - 202
  • [25] Active Learning for Effectively Fine-Tuning Transfer Learning to Downstream Task
    Abul Bashar, Md
    Nayak, Richi
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (02)
  • [26] Residual Physics Learning and System Identification for Sim-to-real Transfer of Policies on Buoyancy Assisted Legged Robots
    Sontakke, Nitish
    Chae, Hosik
    Lee, Sangjoon
    Huang, Tianle
    Hong, Dennis W.
    Ha, Sehoon
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 392 - 399
  • [27] Efficient Index Learning via Model Reuse and Fine-tuning
    Liu, Guanli
    Qi, Jianzhong
    Kulik, Lars
    Soga, Kazuya
    Borovica-Gajic, Renata
    Rubinstein, Benjamin I. P.
    2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 60 - 66
  • [28] Fine-Tuning Network in Federated Learning for Personalized Skin Diagnosis
    Lee, Kyungsu
    Lee, Haeyun
    Cavalcanti, Thiago Coutinho
    Kim, Sewoong
    El Fakhri, Georges
    Lee, Dong Hun
    Woo, Jonghye
    Hwang, Jae Youn
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 378 - 388
  • [29] A statistical learning based approach for parameter fine-tuning of metaheuristics
    Calvet, Laura
    Juan, Angel A.
    Serrat, Caries
    Ries, Jana
    SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2016, 40 (01) : 201 - 223
  • [30] Quantum device fine-tuning using unsupervised embedding learning
    van Esbroeck, N. M.
    Lennon, D. T.
    Moon, H.
    Nguyen, V
    Vigneau, F.
    Camenzind, L. C.
    Yu, L.
    Zumbuehl, D. M.
    Briggs, G. A. D.
    Sejdinovic, D.
    Ares, N.
    NEW JOURNAL OF PHYSICS, 2020, 22 (09):