Ssman: self-supervised masked adaptive network for 3D human pose estimation

被引:1
|
作者
Shi, Yu [1 ]
Yue, Tianyi [1 ]
Zhao, Hu [1 ]
He, Guoping [1 ]
Ren, Keyan [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, 100 Pingleyuan, Beijing 100124, Peoples R China
关键词
Deep learning; Human pose estimation; Adaption ability; Self-supervised learning;
D O I
10.1007/s00138-024-01514-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The modern deep learning-based models for 3D human pose estimation from monocular images always lack the adaption ability between occlusion and non-occlusion scenarios, which might restrict the performance of current methods when faced with various scales of occluded conditions. In an attempt to tackle this problem, we propose a novel network called self-supervised masked adaptive network (SSMAN). Firstly, we leverage different levels of masks to cover the richness of occlusion in fully in-the-wild environment. Then, we design a multi-line adaptive network, which could be trained with various scales of masked images in parallel. Based on this masked adaptive network, we train it with self-supervised learning to enforce the consistency across the outputs under different mask ratios. Furthermore, a global refinement module is proposed to leverage global features of the human body to refine the human pose estimated solely by local features. We perform extensive experiments both on the occlusion datasets like 3DPW-OCC and OCHuman and general datasets such as Human3.6M and 3DPW. The results show that SSMAN achieves new state-of-the-art performance on both lightly and heavily occluded benchmarks and is highly competitive with significant improvement on standard benchmarks.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Ssman: self-supervised masked adaptive network for 3D human pose estimation
    Yu Shi
    Tianyi Yue
    Hu Zhao
    Guoping He
    Keyan Ren
    Machine Vision and Applications, 2024, 35
  • [2] Self-supervised 3D human pose estimation from video
    Gholami, Mohsen
    Rezaei, Ahmad
    Rhodin, Helge
    Ward, Rabab
    Wang, Z. Jane
    NEUROCOMPUTING, 2022, 488 : 97 - 106
  • [3] Rotated Orthographic Projection for Self-supervised 3D Human Pose Estimation
    Yao, Yao
    Pan, Yixuan
    Shi, Wenjun
    Zhu, Dongchen
    Wang, Lei
    Li, Jiamao
    COMPUTER VISION - ECCV 2024, PT LXIX, 2025, 15127 : 422 - 439
  • [4] CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild
    Wandt, Bastian
    Rudolph, Marco
    Zell, Petrissa
    Rhodin, Helge
    Rosenhahn, Bodo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13289 - 13299
  • [5] 3D Human Pose Machines with Self-Supervised Learning
    Wang, Keze
    Lin, Liang
    Jiang, Chenhan
    Qian, Chen
    Wei, Pengxu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1069 - 1082
  • [6] Multi-View 3D Human Pose Estimation with Self-Supervised Learning
    Chang, Inho
    Park, Min-Gyu
    Kim, Jaewoo
    Yoon, Ju Hong
    3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 255 - 257
  • [7] Self-Supervised 3D Human Pose Estimation with Multiple-View Geometry
    Bouazizi, Arij
    Wiederer, Julian
    Kressel, Ulrich
    Belagiannis, Vasileios
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [8] Geometry-Driven Self-Supervised Method for 3D Human Pose Estimation
    Li, Yang
    Li, kan
    Jiang, Shuai
    Zhang, Ziyue
    Huang, Congzhentao
    Xu, Richard Yi Da
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11442 - 11449
  • [9] Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation
    Kundu, Jogendra Nath
    Seth, Siddharth
    Pradyumna, Y. M.
    Jampani, Varun
    Chakraborty, Anirban
    Babu, R. Venkatesh
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20416 - 20427
  • [10] Self-supervised method for 3D human pose estimation with consistent shape and viewpoint factorization
    Zhichao Ma
    Kan Li
    Yang Li
    Applied Intelligence, 2023, 53 : 3864 - 3876