Meta-Learning for Fast Adaptive Locomotion with Uncertainties in Environments and Robot Dynamics

被引:7
作者
Anne, Timothee [1 ]
Wilkinson, Jack [2 ]
Li, Zhibin [2 ]
机构
[1] ENS Rennes, Rennes, France
[2] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
来源
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年
基金
英国工程与自然科学研究理事会;
关键词
BAYESIAN OPTIMIZATION;
D O I
10.1109/IROS51168.2021.9635840
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work developed meta-learning control policies to achieve fast online adaptation to different changing conditions, which generate diverse and robust locomotion. The proposed method updates the interaction model constantly, samples feasible sequences of actions of estimated state-action trajectories, and then applies the optimal actions to maximize the reward. To achieve online model adaptation, our proposed method learns different latent vectors of each training condition, which is selected online based on newly collected data from the past 10 samples within 0.2s. Our work designs appropriate state space and reward functions, and optimizes feasible actions in an MPC fashion which are sampled directly in the joint space with constraints, hence requiring no prior design or training of specific gaits. We further demonstrated the robot's capability of detecting unexpected changes during the interaction and adapting the control policy in less than 0.2s. The extensive validation on the SpotMicro robot in a physics simulation shows adaptive and robust locomotion skills under changing ground friction, external pushes, and different robot dynamics including motor failures and the whole leg amputation.
引用
收藏
页码:4568 / 4575
页数:8
相关论文
共 29 条
  • [1] Brochu E., 2010, ARXIV10122599
  • [2] Contact-Implicit Trajectory Optimization Using an Analytically Solvable Contact Model for Locomotion on Variable Ground
    Chatzinikolaidis, Iordanis
    You, Yangwei
    Li, Zhibin
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6357 - 6364
  • [3] Chua K, 2018, ADV NEUR IN, V31
  • [4] Dallali H., CYBERNETICS INFORM T, V12, P39
  • [5] Deisenroth M., 2011, P INT C MACH LEARN I, P465
  • [6] Fankhauser P., 2018, 2018 IEEE International Conference on Robotics and Automation ICRA, P1
  • [7] Finn C, 2017, PR MACH LEARN RES, V70
  • [8] Forestier S., CORR
  • [9] Gal Y, 2016, PR MACH LEARN RES, V48
  • [10] Ha D, 2018, ADV NEUR IN, V31