Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video

被引：0

作者：

Sun, Cheng ^{[1
]}

Thomas, Diego ^{[1
]}

Kawasaki, Hiroshi ^{[1
]}

机构：

[1] Kyushu Univ, Fukuoka, Japan

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

关键词：

D O I：

10.1109/ICPR48806.2021.9412270

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D human pose estimation from a single 2D video is an extremely difficult task because computing 3D geometry from 2D images is an ill-posed problem. Recent popular solutions adopt fully-supervised learning strategy, which requires to train a deep network on a large-scale ground truth dataset of 3D poses and 2D images. However, such a large-scale dataset with natural images does not exist, which limits the usability of existing methods. While building a complete 3D dataset is tedious and expensive, abundant 2D in-the-wild data is already publicly available. As a consequence, there is a growing interest in the computer vision community to design efficient techniques that use the unsupervised learning strategy, which does not require any ground truth 3D data. Such methods can be trained with only natural 2D images of humans. In this paper we propose an unsupervised method for estimating 3D human pose in videos. The standard approach for unsupervised learning is to use the Generative Adversarial Network (GAN) framework. To improve the performance of 3D human pose estimation in videos, we propose a new GAN network that enforces body consistency over frames in a video. We evaluate the efficiency of our proposed method on a public 3D human body dataset.

引用

页码：5959 / 5964

页数：6

共 30 条

[1] [Anonymous], 2018, EUR C COMP VIS ECCV
[2] Arjovsky M, 2017, PR MACH LEARN RES, V70
[3] 3D Human Pose Estimation via Deep Learning from 2D annotations
Brau, Ernesto
Jiang, Hao
[J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 582 - 591
[4] Unsupervised 3D Pose Estimation with Geometric Self-Supervision
Chen, Ching-Hang
Tyagi, Ambrish
Agrawal, Amit
Drover, Dylan
Rohith, M., V
Stojanov, Stefan
Rehg, James M.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5707 - 5717
[5] Chen Chun-Jung, 2017, Collection and Research (Taichung), P1, DOI 10.6693/CAR201712_30(1).0001
[6] Dabral Rishabh, 2018, ECCV, P668
[7] Can 3D Pose Be Learned from 2D Projections Alone?
Drover, Dylan
Rohith, M., V
Chen, Ching-Hang
Agrawal, Amit
Tyagi, Ambrish
Cong Phuoc Huynh
[J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 78 - 94
[8] Goodfellow I., 2020, ADV NEUR IN, V63, P139, DOI [DOI 10.1145/3422622, 10.1145/3422622]
[9] Exploiting Temporal Information for 3D Human Pose Estimation
Hossain, Mir Rayat Imtiaz
Little, James J.
[J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 69 - 86
[10] Learnable Triangulation of Human Pose
Iskakov, Karim
Burkov, Egor
Lempitsky, Victor
Malkov, Yury
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7717 - 7726

← 1 2 3 →