Weakly-Supervised Reconstruction of 3D Objects with Large Shape Variation from Single In-the-Wild Images

被引:2
作者
Sun, Shichen [1 ]
Zhu, Zhengbang [1 ]
Dai, Xiaowei [1 ]
Zhao, Qijun [1 ,2 ]
Li, Jing [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu, Peoples R China
[2] Tibet Univ, Sch Informat Sci & Technol, Lhasa, Peoples R China
来源
COMPUTER VISION - ACCV 2020, PT I | 2021年 / 12622卷
基金
中国国家自然科学基金;
关键词
D O I
10.1007/978-3-030-69525-5_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing unsupervised 3D object reconstruction methods can not work well if the shape of objects varies substantially across images or if the images have distracting background. This paper proposes a novel learning framework for reconstructing 3D objects with large shape variation from single in-the-wild images. Considering that shape variation leads to appearance change of objects at various scales, we propose a fusion module to form combined multi-scale image features for 3D reconstruction. To deal with the ambiguity caused by shape variation, we propose side-output mask constraint to supervise the feature extraction, and adaptive edge constraint and initial shape constraint to supervise the shape reconstruction. Moreover, we propose background manipulation to augment the training images such that the obtained model is robust to background distraction. Extensive experiments have been done for both non-rigid objects (birds) and rigid objects (planes and vehicles), and the results prove the superiority of the proposed method.
引用
收藏
页码:3 / 19
页数:17
相关论文
共 42 条
[1]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[2]   What Shape Are Dolphins? Building 3D Morphable Models from 2D Images [J].
Cashman, Thomas J. ;
Fitzgibbon, Andrew W. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :232-244
[3]   Unsupervised 3D Reconstruction Networks [J].
Cha, Geonho ;
Lee, Minsik ;
Oh, Songhwai .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3848-3857
[4]   3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction [J].
Choy, Christopher B. ;
Xu, Danfei ;
Gwak, Jun Young ;
Chen, Kevin ;
Savarese, Silvio .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :628-644
[5]   A Point Set Generation Network for 3D Object Reconstruction from a Single Image [J].
Fan, Haoqiang ;
Su, Hao ;
Guibas, Leonidas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2463-2471
[6]   Learning a Predictable and Generative Vector Representation for Objects [J].
Girdhar, Rohit ;
Fouhey, David F. ;
Rodriguez, Mikel ;
Gupta, Abhinav .
COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 :484-499
[7]   A Papier-Mache Approach to Learning 3D Surface Generation [J].
Groueix, Thibault ;
Fisher, Matthew ;
Kim, Vladimir G. ;
Russell, Bryan C. ;
Aubry, Mathieu .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :216-224
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]  
Insafutdinov E, 2018, ADV NEUR IN, V31
[10]   Learning Category-Specific Mesh Reconstruction from Image Collections [J].
Kanazawa, Angjoo ;
Tulsiani, Shubham ;
Efros, Alexei A. ;
Malik, Jitendra .
COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 :386-402