Human-inspired Video Imitation Learning on Humanoid Model

被引:2
作者
Lee, Chun Hei [1 ]
Yueh, Nicole Chee Lin [1 ]
Woo, Kam Tim [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
来源
2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC | 2022年
关键词
imitation learning; GAIL; locomotion; humanoid model;
D O I
10.1109/IRC55401.2022.00068
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Generating good and human-like locomotion or other legged motions for bipedal robots has always been challenging. One of the emerging solutions to this challenge is to use imitation learning. The sources for imitation are mostly state-only demonstrations, so using state-of-the-art Generative Adversarial Imitation Learning (GAIL) with Imitation from Observation (IfO) ability will be an ideal frameworks to use in solving this problem. However, it is often difficult to allow new or complicated movements as the common sources for these frameworks are either expensive to set up or hard to produce satisfactory results without computationally expensive preprocessing, due to accuracy problems. Inspired by how people learn advanced knowledge after acquiring basic understandings of specific subjects, this paper proposes a Motion capture-aided Video Imitation (MoVI) learning framework based on Adversarial Motion Priors (AMP) by combining motion capture data of primary actions like walking with video clips of target motion like running, aiming to create smooth and natural imitation results of the target motion. This framework is able to produce various human-like locomotion by taking the most common and abundant motion capture data with any video clips of motion without the need for expensive datasets or sophisticated preprocessing.
引用
收藏
页码:345 / 352
页数:8
相关论文
共 14 条
  • [1] Carnegie Mellon University Graphics Lab, 2020, CARN MELL U MOT CAPT
  • [2] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [3] Heess N, 2017, Arxiv, DOI arXiv:1707.02286
  • [4] Ho J, 2016, ADV NEUR IN, V29
  • [5] CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark
    Li, Jiefeng
    Wang, Can
    Zhu, Hao
    Mao, Yihuan
    Fang, Hao-Shu
    Lu, Cewu
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10855 - 10864
  • [6] Liu YX, 2018, IEEE INT CONF ROBOT, P1118
  • [7] Makoviichuk Denys, 2022, RL GAMES HIGH PERFOR
  • [8] Makoviychuk Viktor, 2021, Isaac gym: High performance gpu-based physics simulation for robot learning
  • [9] Least Squares Generative Adversarial Networks
    Mao, Xudong
    Li, Qing
    Xie, Haoran
    Lau, Raymond Y. K.
    Wang, Zhen
    Smolley, Stephen Paul
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2813 - 2821
  • [10] Merel J, 2017, Arxiv, DOI arXiv:1707.02201