Self-Supervised 3D Representation Learning of Dressed Humans From Social Media Videos

被引:0
|
作者
Jafarian, Yasamin [1 ]
Park, Hyun Soo [1 ]
机构
[1] Univ Minnesota, Minneapolis, MN 55455 USA
关键词
Depth estimation; dataset; high fidelity human reconstruction; normal estimation; single view 3D reconstruction; self-supervised learning;
D O I
10.1109/TPAMI.2022.3231558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key challenge of learning a visual representation for the 3D high fidelity geometry of dressed humans lies in the limited availability of the ground truth data (e.g., 3D scanned models), which results in the performance degradation of 3D human reconstruction when applying to real-world imagery. We address this challenge by leveraging a new data resource: a number of social media dance videos that span diverse appearance, clothing styles, performances, and identities. Each video depicts dynamic movements of the body and clothes of a single person while lacking the 3D ground truth geometry. To learn a visual representation from these videos, we present a new self-supervised learning method to use the local transformation that warps the predicted local geometry of the person from an image to that of another image at a different time instant. This allows self-supervision by enforcing a temporal coherence over the predictions. In addition, we jointly learn the depths along with the surface normals that are highly responsive to local texture, wrinkle, and shade by maximizing their geometric consistency. Our method is end-to-end trainable, resulting in high fidelity depth estimation that predicts fine geometry faithful to the input real image. We further provide a theoretical bound of self-supervised learning via an uncertainty analysis that characterizes the performance of the self-supervised learning without training. We demonstrate that our method outperforms the state-of-the-art human depth estimation and human shape recovery approaches on both real and rendered images.
引用
收藏
页码:8969 / 8983
页数:15
相关论文
共 50 条
  • [41] Randomly shuffled convolution for self-supervised representation learning
    Oh, Youngjin
    Jeon, Minkyu
    Ko, Dohwan
    Kim, Hyunwoo J.
    INFORMATION SCIENCES, 2023, 623 : 206 - 219
  • [42] Self-supervised representation learning for surgical activity recognition
    Paysan, Daniel
    Haug, Luis
    Bajka, Michael
    Oelhafen, Markus
    Buhmann, Joachim M.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (11) : 2037 - 2044
  • [43] AtmoDist: Self-supervised representation learning for atmospheric dynamics
    Hoffmann, Sebastian
    Lessig, Christian
    ENVIRONMENTAL DATA SCIENCE, 2023, 2
  • [44] Heuristic Attention Representation Learning for Self-Supervised Pretraining
    Van Nhiem Tran
    Liu, Shen-Hsuan
    Li, Yung-Hui
    Wang, Jia-Ching
    SENSORS, 2022, 22 (14)
  • [45] EgoFish3D: Egocentric 3D Pose Estimation From a Fisheye Camera via Self-Supervised Learning
    Liu, Yuxuan
    Yang, Jianxin
    Gu, Xiao
    Chen, Yijun
    Guo, Yao
    Yang, Guang-Zhong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8880 - 8891
  • [46] Phonetically Motivated Self-Supervised Speech Representation Learning
    Yue, Xianghu
    Li, Haizhou
    INTERSPEECH 2021, 2021, : 746 - 750
  • [47] Random Field Augmentations for Self-Supervised Representation Learning
    Mansfield, Philip Andrew
    Afkanpour, Arash
    Morningstar, Warren Richard
    Singhal, Karan
    NEURIPS WORKSHOP ON SYMMETRY AND GEOMETRY IN NEURAL REPRESENTATIONS, 2023, 228 : 292 - 302
  • [48] Video Motion Perception for Self-supervised Representation Learning
    Li, Wei
    Luo, Dezhao
    Fang, Bo
    Li, Xiaoni
    Zhou, Yu
    Wang, Weiping
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 508 - 520
  • [49] Self-supervised representation learning by predicting visual permutations
    Zhao, Qilu
    Dong, Junyu
    KNOWLEDGE-BASED SYSTEMS, 2020, 210
  • [50] Self-supervised Graph Representation Learning with Variational Inference
    Liao, Zihan
    Liang, Wenxin
    Liu, Han
    Mu, Jie
    Zhang, Xianchao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 116 - 127