Self-supervised Multi-view Stereo via View Synthesis Representation Consistency

被引:0
作者
Zhang, Hang [1 ]
Cao, Jie [2 ]
Wu, Xingming [1 ]
Liu, Zhong [1 ]
Chen, Weihai [1 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[2] Zhejiang Mobile Informat Syst Integrat Co Ltd, Hangzhou, Peoples R China
来源
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC | 2023年
基金
中国国家自然科学基金;
关键词
Multi-view Stereo; self-supervised; depth estimation; joint feature consistency; data joint representation augmentation;
D O I
10.1109/CCDC58219.2023.10327103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised Multi-view Stereo technique aiming at the reconstruction of 3D models from 2D images has made great progress. However, existing methods are mainly based on the premise that the corresponding pixels between different views have similar characteristics. However, in the actual scene, we often suffer from the interference of the occlusion area and non-Lambert surface, which reduces the quality of the depth map obtained by depth estimation, and also affects the accuracy and completeness of the final generated point cloud model. In this paper, we propose a new self-supervised framework that uses joint feature consistency and view synthesis representation consistency to construct the self-supervised signal. Additionally, we add data joint representation augmentation mechanism branch to capture the similarity degree of corresponding pixel points between images, so as to improve the unsatisfactory situation faced in the process of large-scale real depth acquisition. The results of experiments on DTU dataset show that our proposed method has good performance, even better than some supervised methods. Furthermore, the results of experiments on Tanks&Temples dataset prove that it also has good generalization ability.
引用
收藏
页码:876 / 881
页数:6
相关论文
共 28 条
[1]   Large-Scale Data for Multiple-View Stereopsis [J].
Aanaes, Henrik ;
Jensen, Rasmus Ramsbol ;
Vogiatzis, George ;
Tola, Engin ;
Dahl, Anders Bjorholm .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) :153-168
[2]   PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing [J].
Barnes, Connelly ;
Shechtman, Eli ;
Finkelstein, Adam ;
Goldman, Dan B. .
ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)
[3]  
Chang D., 2022, ARXIV220303949
[4]   Point-Based Multi-View Stereo Network [J].
Chen, Rui ;
Han, Songfang ;
Xu, Jing ;
Su, Hao .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1538-1547
[5]   Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness [J].
Cheng, Shuo ;
Xu, Zexiang ;
Zhu, Shilin ;
Li, Zhuwen ;
Li, Li Erran ;
Ramamoorthi, Ravi ;
Su, Hao .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2521-2531
[6]   An Adaptive EKF-FMPC for the Trajectory Tracking of UVMS [J].
Dai, Yong ;
Yu, Shuanghe ;
Yan, Yan .
IEEE JOURNAL OF OCEANIC ENGINEERING, 2020, 45 (03) :699-713
[7]   KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo [J].
Ding, Yikang ;
Zhu, Qingtian ;
Liu, Xiangyue ;
Yuan, Wentao ;
Zhang, Haotian ;
Zhang, Chi .
COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 :630-646
[8]   Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching [J].
Gu, Xiaodong ;
Fan, Zhiwen ;
Zhu, Siyu ;
Dai, Zuozhuo ;
Tan, Feitong ;
Tan, Ping .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2492-2501
[9]   M3VSNET: UNSUPERVISED MULTI-METRIC MULTI-VIEW STEREO NETWORK [J].
Huang, Baichuan ;
Yi, Hongwei ;
Huang, Can ;
He, Yijia ;
Liu, Jingbin ;
Liu, Xiao .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :3163-3167
[10]   SurfaceNet: An End-to-end 3D Neural Network for Multiview Stereopsis [J].
Ji, Mengqi ;
Gall, Juergen ;
Zheng, Haitian ;
Liu, Yebin ;
Fang, Lu .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2326-2334