Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video

被引:0
|
作者
Sun, Cheng [1 ]
Thomas, Diego [1 ]
Kawasaki, Hiroshi [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
关键词
D O I
10.1109/ICPR48806.2021.9412270
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human pose estimation from a single 2D video is an extremely difficult task because computing 3D geometry from 2D images is an ill-posed problem. Recent popular solutions adopt fully-supervised learning strategy, which requires to train a deep network on a large-scale ground truth dataset of 3D poses and 2D images. However, such a large-scale dataset with natural images does not exist, which limits the usability of existing methods. While building a complete 3D dataset is tedious and expensive, abundant 2D in-the-wild data is already publicly available. As a consequence, there is a growing interest in the computer vision community to design efficient techniques that use the unsupervised learning strategy, which does not require any ground truth 3D data. Such methods can be trained with only natural 2D images of humans. In this paper we propose an unsupervised method for estimating 3D human pose in videos. The standard approach for unsupervised learning is to use the Generative Adversarial Network (GAN) framework. To improve the performance of 3D human pose estimation in videos, we propose a new GAN network that enforces body consistency over frames in a video. We evaluate the efficiency of our proposed method on a public 3D human body dataset.
引用
收藏
页码:5959 / 5964
页数:6
相关论文
共 50 条
  • [21] Deep Semantic Graph Transformer for Multi-View 3D Human Pose Estimation
    Zhang, Lijun
    Zhou, Kangkang
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7205 - 7214
  • [22] View Invariant 3D Human Pose Estimation
    Wei, Guoqiang
    Lan, Cuiling
    Zeng, Wenjun
    Chen, Zhibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4601 - 4610
  • [23] 3D Human Pose Estimation from multi-view thermal vision sensors
    Lupion, Marcos
    Polo-Rodriguez, Aurora
    Medina-Quero, Javier
    Sanjuan, Juan F.
    Ortigosa, Pilar M.
    INFORMATION FUSION, 2024, 104
  • [24] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
    Zhou, Kangkang
    Zhang, Lijun
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
  • [25] Unsupervised 3D Pose Estimation for Hierarchical Dance Video Recognition
    Hu, Xiaodan
    Ahuja, Narendra
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10995 - 11004
  • [26] MULTI HYBRID EXTRACTOR NETWORK FOR 3D HUMAN POSE ESTIMATION
    Yuan, Zhixiang
    Zhang, Xitie
    Wu, Suping
    Zhang, Boyang
    Peng, Yuxin
    Wang, Bing
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3170 - 3174
  • [27] Unsupervised Domain Adaptation for 3D Human Pose Estimation
    Zhang, Xiheng
    Wong, Yongkang
    Kankanhalli, Mohan S.
    Geng, Weidong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 926 - 934
  • [28] Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation
    Wu, Yongpeng
    Kong, Dehui
    Gao, Junna
    Li, Jinghua
    Yin, Baocai
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
  • [29] POSERN: A 2D POSE REFINEMENT NETWORK FOR BIAS-FREE MULTI-VIEW 3D HUMAN POSE ESTIMATION
    Sayo, Akihiko
    Thomas, Diego
    Kawasaki, Hiroshi
    Nakashima, Yuta
    Ikeuchi, Katsushi
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3233 - 3237
  • [30] Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation
    Niu, Zehai
    Lu, Ke
    Xue, Jian
    Wang, Jinbao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246