Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video

被引:0
作者
Sun, Cheng [1 ]
Thomas, Diego [1 ]
Kawasaki, Hiroshi [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
来源
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年
关键词
D O I
10.1109/ICPR48806.2021.9412270
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human pose estimation from a single 2D video is an extremely difficult task because computing 3D geometry from 2D images is an ill-posed problem. Recent popular solutions adopt fully-supervised learning strategy, which requires to train a deep network on a large-scale ground truth dataset of 3D poses and 2D images. However, such a large-scale dataset with natural images does not exist, which limits the usability of existing methods. While building a complete 3D dataset is tedious and expensive, abundant 2D in-the-wild data is already publicly available. As a consequence, there is a growing interest in the computer vision community to design efficient techniques that use the unsupervised learning strategy, which does not require any ground truth 3D data. Such methods can be trained with only natural 2D images of humans. In this paper we propose an unsupervised method for estimating 3D human pose in videos. The standard approach for unsupervised learning is to use the Generative Adversarial Network (GAN) framework. To improve the performance of 3D human pose estimation in videos, we propose a new GAN network that enforces body consistency over frames in a video. We evaluate the efficiency of our proposed method on a public 3D human body dataset.
引用
收藏
页码:5959 / 5964
页数:6
相关论文
共 30 条
  • [1] [Anonymous], 2018, EUR C COMP VIS ECCV
  • [2] Arjovsky M, 2017, PR MACH LEARN RES, V70
  • [3] 3D Human Pose Estimation via Deep Learning from 2D annotations
    Brau, Ernesto
    Jiang, Hao
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 582 - 591
  • [4] Unsupervised 3D Pose Estimation with Geometric Self-Supervision
    Chen, Ching-Hang
    Tyagi, Ambrish
    Agrawal, Amit
    Drover, Dylan
    Rohith, M., V
    Stojanov, Stefan
    Rehg, James M.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5707 - 5717
  • [5] Chen Chun-Jung, 2017, Collection and Research (Taichung), P1, DOI 10.6693/CAR201712_30(1).0001
  • [6] Dabral Rishabh, 2018, ECCV, P668
  • [7] Can 3D Pose Be Learned from 2D Projections Alone?
    Drover, Dylan
    Rohith, M., V
    Chen, Ching-Hang
    Agrawal, Amit
    Tyagi, Ambrish
    Cong Phuoc Huynh
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 78 - 94
  • [8] Goodfellow I., 2020, ADV NEUR IN, V63, P139, DOI [DOI 10.1145/3422622, 10.1145/3422622]
  • [9] Exploiting Temporal Information for 3D Human Pose Estimation
    Hossain, Mir Rayat Imtiaz
    Little, James J.
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 69 - 86
  • [10] Learnable Triangulation of Human Pose
    Iskakov, Karim
    Burkov, Egor
    Lempitsky, Victor
    Malkov, Yury
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7717 - 7726