Self-supervised non-rigid structure from motion with improved training of Wasserstein GANs

被引:1
作者
Wang, Yaming [1 ,2 ]
Peng, Xiangyang [2 ]
Huang, Wenqing [2 ]
Ye, Xiaoping [1 ]
Jiang, Mingfeng [2 ]
机构
[1] Lishui Univ, Zhejiang Key Lab DDIMCCP, Lishui, Peoples R China
[2] Zhejiang Sci Tech Univ, Pattern Recognit & Comp Vis Lab, Hangzhou 310000, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; neural nets;
D O I
10.1049/cvi2.12175
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study proposes a self-supervised method to reconstruct 3D limbic structures from 2D landmarks extracted from a single view. The loss of self-consistency can be reduced by performing a random orthogonal projection of the reconstructed 3D structure. Thus, the training process can be self-supervised by using geometric self-consistency in the reconstruction-projection-reconstruction process. The self-supervised network mainly consists of graph convolution and Transformer encoders. This network is called the SS-Graphformer. By adding a discriminator, the SS-Graphformer is used as a generator to form a Wasserstein Generative Adversarial Network architecture with a Gradient Penalty to improve the accuracy of the reconstruction. It is experimentally demonstrated that the addition of the 2D structure discriminator can significantly improve the accuracy of the reconstruction.
引用
收藏
页码:404 / 414
页数:11
相关论文
共 33 条
[21]   C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion [J].
Novotny, David ;
Ravi, Nikhila ;
Graham, Benjamin ;
Neverova, Natalia ;
Vedaldi, Andrea .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7687-7696
[22]   Procrustean Regression Networks: Learning 3D Structure of Non-rigid Objects from 2D Annotations [J].
Park, Sungheon ;
Lee, Minsik ;
Kwak, Nojun .
COMPUTER VISION - ECCV 2020, PT XXIX, 2020, 12374 :1-18
[23]   Procrustean Regression: A Flexible Alignment-Based Framework for Nonrigid Structure Estimation [J].
Park, Sungheon ;
Lee, Minsik ;
Kwak, Nojun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :249-264
[24]   Neural Dense Non-Rigid Structure from Motion with Latent Space Constraints [J].
Sidhu, Vikramjit ;
Tretschk, Edgar ;
Golyanik, Vladislav ;
Agudo, Antonio ;
Theobalt, Christian .
COMPUTER VISION - ECCV 2020, PT XVI, 2020, 12361 :204-222
[25]   Adversarial Inverse Graphics Networks: Learning 2D-to-3D Lifting and Image-to-Image Translation from Unpaired Supervision [J].
Tung, Hsiao-Yu Fish ;
Harley, Adam W. ;
Seto, William ;
Fragkiadaki, Katerina .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4364-4372
[26]  
Vaswani A, 2017, ADV NEUR IN, V30
[27]   RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation [J].
Wandt, Bastian ;
Rosenhahn, Bodo .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7774-7783
[28]   PAUL: Procrustean Autoencoder for Unsupervised Lifting [J].
Wang, Chaoyang ;
Lucey, Simon .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :434-443
[29]   Deep NRSfM plus plus : Towards Unsupervised 2D-3D Lifting in theWild [J].
Wang, Chaoyang ;
Lin, Chen-Hsuan ;
Lucey, Simon .
2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, :12-22
[30]  
Yan SJ, 2018, AAAI CONF ARTIF INTE, P7444