Meta Reinforcement Learning for Optimal Design of Legged Robots

被引：13

作者：

Belmonte-Baeza, Alvaro ^{[1
]}

Lee, Joonho ^{[1
]}

Valsecchi, Giorgio ^{[1
]}

Hutter, Marco ^{[1
]}

机构：

[1] Swiss Fed Inst Technol, Robot Syst Lab RSL, CH-8092 Zurich, Switzerland

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2022年 / 7卷 / 04期

基金：

欧洲研究理事会;

关键词：

Reinforcement Learning; Mechanism Design; Legged Robots;

D O I：

10.1109/LRA.2022.3211785

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The process of robot design is a complex task and the majority of design decisions are still based on human intuition or tedious manual tuning. A more informed way of facing this task is computational design methods where design parameters are concurrently optimized with corresponding controllers. Existing approaches, however, are strongly influenced by predefined control rules or motion templates and cannot provide end-to-end solutions. In this paper, we present a design optimization framework using model-free meta reinforcement learning, and its application to the optimizing kinematics and actuator parameters of quadrupedal robots. We use meta reinforcement learning to train a locomotion policy that can quickly adapt to different designs. This policy is used to evaluate each design instance during the design optimization. We demonstrate that the policy can control robots of different designs to track random velocity commands over various rough terrains. With controlled experiments, we show that the meta policy achieves close-to-optimal performance for each design instance after adaptation. Lastly, we compare our results against a model-based baseline and show that our approach allows higher performance while not being constrained by predefined motions or gait patterns.

引用

页码：12134 / 12141

页数：8

共 29 条

[1] Towards a bio-inspired leg design for high-speed running [J].

Ananthanarayanan, Arvind ;

Azadi, Mojtaba ;

Kim, Sangbae .

BIOINSPIRATION & BIOMIMETICS, 2012, 7 (04)

[2]

[Anonymous], 2021, ANYMAL AUTONOMOUS LE

[3] Vitruvio: An Open-Source Leg Design Optimization Toolbox for Walking Robots [J].

Chadwick, Michael ;

Kolvenbach, Hendrik ;

Dubois, Fabio ;

Lau, Hong Fai ;

Hutter, Marco .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) :6318-6325

[4] Control-Aware Design Optimization for Bio-Inspired Quadruped Robots [J].

De Vincenti, Flavio ;

Kang, Dongho ;

Coros, Stelian .

2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :1354-1361

[5]

Digumarti KM, 2014, MOBILE SERVICE ROBOTICS, P315

[6]

Dinev T, 2022, Arxiv, DOI arXiv:2103.04660

[7]

Finn C, 2017, PR MACH LEARN RES, V70

[8]

Gupta A., 2018, P NIPS, P5307

[9] Reinforcement Learning for Improving Agent Design [J].

Ha, David .

ARTIFICIAL LIFE, 2019, 25 (04) :352-365

[10]

Ha S, 2017, ROBOTICS: SCIENCE AND SYSTEMS XIII

← 1 2 3 →