Self-supervised non-rigid structure from motion with improved training of Wasserstein GANs

被引:1
作者
Wang, Yaming [1 ,2 ]
Peng, Xiangyang [2 ]
Huang, Wenqing [2 ]
Ye, Xiaoping [1 ]
Jiang, Mingfeng [2 ]
机构
[1] Lishui Univ, Zhejiang Key Lab DDIMCCP, Lishui, Peoples R China
[2] Zhejiang Sci Tech Univ, Pattern Recognit & Comp Vis Lab, Hangzhou 310000, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; neural nets;
D O I
10.1049/cvi2.12175
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study proposes a self-supervised method to reconstruct 3D limbic structures from 2D landmarks extracted from a single view. The loss of self-consistency can be reduced by performing a random orthogonal projection of the reconstructed 3D structure. Thus, the training process can be self-supervised by using geometric self-consistency in the reconstruction-projection-reconstruction process. The self-supervised network mainly consists of graph convolution and Transformer encoders. This network is called the SS-Graphformer. By adding a discriminator, the SS-Graphformer is used as a generator to form a Wasserstein Generative Adversarial Network architecture with a Gradient Penalty to improve the accuracy of the reconstruction. It is experimentally demonstrated that the addition of the 2D structure discriminator can significantly improve the accuracy of the reconstruction.
引用
收藏
页码:404 / 414
页数:11
相关论文
共 33 条
[1]  
Akhter I., 2008, Adv. Neural Inform. Process. Syst. (NIPS), P41
[2]   Trajectory Space: A Dual Representation for Nonrigid Structure from Motion [J].
Akhter, Ijaz ;
Sheikh, Yaser ;
Khan, Sohaib ;
Kanade, Takeo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (07) :1442-1456
[3]  
Akhter I, 2009, PROC CVPR IEEE, P1534, DOI 10.1109/CVPRW.2009.5206620
[4]  
Bozic A, 2021, ADV NEUR IN
[5]  
Bregler C, 2000, PROC CVPR IEEE, P690, DOI 10.1109/CVPR.2000.854941
[6]   Unsupervised 3D Pose Estimation with Geometric Self-Supervision [J].
Chen, Ching-Hang ;
Tyagi, Ambrish ;
Agrawal, Amit ;
Drover, Dylan ;
Rohith, M., V ;
Stojanov, Stefan ;
Rehg, James M. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5707-5717
[7]  
Chen YH, 2017, AIP CONF PROC, V1812, DOI [10.1063/1.4975898, 10.1109/ICCV.2017.137]
[8]   A Simple Prior-Free Method for Non-rigid Structure-from-Motion Factorization [J].
Dai, Yuchao ;
Li, Hongdong ;
He, Mingyi .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (02) :101-122
[9]  
Deng H., 2022, arXiv
[10]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929