Human-inspired Video Imitation Learning on Humanoid Model

被引：2

作者：

Lee, Chun Hei ^{[1
]}

Yueh, Nicole Chee Lin ^{[1
]}

Woo, Kam Tim ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China

来源：

2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC | 2022年

关键词：

imitation learning; GAIL; locomotion; humanoid model;

D O I：

10.1109/IRC55401.2022.00068

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Generating good and human-like locomotion or other legged motions for bipedal robots has always been challenging. One of the emerging solutions to this challenge is to use imitation learning. The sources for imitation are mostly state-only demonstrations, so using state-of-the-art Generative Adversarial Imitation Learning (GAIL) with Imitation from Observation (IfO) ability will be an ideal frameworks to use in solving this problem. However, it is often difficult to allow new or complicated movements as the common sources for these frameworks are either expensive to set up or hard to produce satisfactory results without computationally expensive preprocessing, due to accuracy problems. Inspired by how people learn advanced knowledge after acquiring basic understandings of specific subjects, this paper proposes a Motion capture-aided Video Imitation (MoVI) learning framework based on Adversarial Motion Priors (AMP) by combining motion capture data of primary actions like walking with video clips of target motion like running, aiming to create smooth and natural imitation results of the target motion. This framework is able to produce various human-like locomotion by taking the most common and abundant motion capture data with any video clips of motion without the need for expensive datasets or sophisticated preprocessing.

引用

页码：345 / 352

页数：8

共 14 条

[1] Carnegie Mellon University Graphics Lab, 2020, CARN MELL U MOT CAPT
[2] RMPE: Regional Multi-Person Pose Estimation
Fang, Hao-Shu
Xie, Shuqin
Tai, Yu-Wing
Lu, Cewu
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
[3] Heess N, 2017, Arxiv, DOI arXiv:1707.02286
[4] Ho J, 2016, ADV NEUR IN, V29
[5] CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark
Li, Jiefeng
Wang, Can
Zhu, Hao
Mao, Yihuan
Fang, Hao-Shu
Lu, Cewu
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10855 - 10864
[6] Liu YX, 2018, IEEE INT CONF ROBOT, P1118
[7] Makoviichuk Denys, 2022, RL GAMES HIGH PERFOR
[8] Makoviychuk Viktor, 2021, Isaac gym: High performance gpu-based physics simulation for robot learning
[9] Least Squares Generative Adversarial Networks
Mao, Xudong
Li, Qing
Xie, Haoran
Lau, Raymond Y. K.
Wang, Zhen
Smolley, Stephen Paul
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2813 - 2821
[10] Merel J, 2017, Arxiv, DOI arXiv:1707.02201

← 1 2 →