Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World

被引：31

作者：

Smith, Laura ^{[1
]}

Kew, J. Chase ^{[2
]}

Peng, Xue Bin ^{[1
]}

Ha, Sehoon ^{[2
,3
]}

Tang, Jie ^{[2
]}

Levine, Sergey ^{[1
,2
]}

机构：

[1] Univ Calif Berkeley, Berkeley AI Res, Berkeley, CA 90095 USA

[2] Google Res, New York, NY USA

[3] Gcorgia Inst Technol, Atlanta, GA USA

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022) | 2022年

关键词：

D O I：

10.1109/ICRA46639.2022.9812166

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Legged robots are physically capable of traversing a wide range of challenging environments, but designing controllers that are sufficiently robust to handle this diversity has been a long-standing challenge in robotics. Reinforcement learning presents an appealing approach for automating the controller design process and has been able to produce remarkably robust controllers when trained in a suitable range of environments. However, it is difficult to predict all likely conditions the robot will encounter during deployment and enumerate them at training-time. What if instead of training controllers that are robust enough to handle any eventuality, we enable the robot to continually learn in any setting it finds itself in? This kind of real-world reinforcement learning poses a number of challenges, including efficiency, safety, and autonomy. To address these challenges, we propose a practical robot reinforcement learning system for fine-tuning locomotion policies in the real world. We demonstrate that a modest amount of real-world training can substantially improve performance during deployment, and this enables a real A1 quadrupedal robot to autonomously fine-tune multiple locomotion skills in a range of environments, including an outdoor lawn and a variety of indoor terrains. (Videos and code1)

引用

页码：1593 / 1599

页数：7

共 50 条

[21] Editorial: Towards Real-World Deployment of Legged Robots
Kottege, Navinda
Sentis, Luis
Kanoulas, Dimitrios
FRONTIERS IN ROBOTICS AND AI, 2022, 8
[22] A Fine-Tuning Strategy Based on Real Scenes in Gait Identification
Zhang, Xianggang
Zeng, Jing
Wang, Guoyu
UBIQUITOUS SECURITY, 2022, 1557 : 336 - 350
[23] Mammalian microRNAs: a small world for fine-tuning gene expression
Sevignani, C
Calin, GA
Siracusa, LD
Croce, CM
MAMMALIAN GENOME, 2006, 17 (03) : 189 - 202
[24] Mammalian microRNAs: a small world for fine-tuning gene expression
Cinzia Sevignani
George A. Calin
Linda D. Siracusa
Carlo M. Croce
Mammalian Genome, 2006, 17 : 189 - 202
[25] Active Learning for Effectively Fine-Tuning Transfer Learning to Downstream Task
Abul Bashar, Md
Nayak, Richi
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (02)
[26] Residual Physics Learning and System Identification for Sim-to-real Transfer of Policies on Buoyancy Assisted Legged Robots
Sontakke, Nitish
Chae, Hosik
Lee, Sangjoon
Huang, Tianle
Hong, Dennis W.
Ha, Sehoon
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 392 - 399
[27] Efficient Index Learning via Model Reuse and Fine-tuning
Liu, Guanli
Qi, Jianzhong
Kulik, Lars
Soga, Kazuya
Borovica-Gajic, Renata
Rubinstein, Benjamin I. P.
2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 60 - 66
[28] Fine-Tuning Network in Federated Learning for Personalized Skin Diagnosis
Lee, Kyungsu
Lee, Haeyun
Cavalcanti, Thiago Coutinho
Kim, Sewoong
El Fakhri, Georges
Lee, Dong Hun
Woo, Jonghye
Hwang, Jae Youn
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 378 - 388
[29] A statistical learning based approach for parameter fine-tuning of metaheuristics
Calvet, Laura
Juan, Angel A.
Serrat, Caries
Ries, Jana
SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2016, 40 (01) : 201 - 223
[30] Quantum device fine-tuning using unsupervised embedding learning
van Esbroeck, N. M.
Lennon, D. T.
Moon, H.
Nguyen, V
Vigneau, F.
Camenzind, L. C.
Yu, L.
Zumbuehl, D. M.
Briggs, G. A. D.
Sejdinovic, D.
Ares, N.
NEW JOURNAL OF PHYSICS, 2020, 22 (09):

← 1 2 3 4 5 →