Semisupervised learning-based depth estimation with semantic inference guidance

被引:0
作者
Yan Zhang
XiaoPeng Fan
DeBin Zhao
机构
[1] Harbin Institute of Technology,Department of Computer Science and Technology
来源
Science China Technological Sciences | 2022年 / 65卷
关键词
depth estimation; semisupervised learning; semantic information; neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Depth estimation is a fundamental computer vision problem that infers three-dimensional (3D) structures from a given scene. As it is an ill-posed problem, to fit the projection function from the given scene to the 3D structure, traditional methods generally require mass amounts of annotated data. Such pixel-level annotation is quite labor consuming, especially when addressing reflective surfaces such as mirrors or water. The widespread application of deep learning further intensifies the demand for large amounts of annotated data. Therefore, it is urgent and necessary to propose a framework that is able to reduce the requirement on the amount of data. In this paper, we propose a novel semisupervised learning framework to infer the 3D structure from the given scene. First, semantic information is employed to make the depth inference more accurate. Second, we make both the depth estimation and semantic segmentation coarse-to-fine frameworks; thus, the depth estimation can be gradually guided by semantic segmentation. We compare our model with state-of-the-art methods. The experimental results demonstrate that our method is better than many supervised learning-based methods, which proves the effectiveness of the proposed method.
引用
收藏
页码:1098 / 1106
页数:8
相关论文
共 36 条
[1]  
Saxena A(2009)Make3D: Learning 3D scene structure from a single still image IEEE Trans Pattern Anal Mach Intell 31 824-840
[2]  
Min Sun A(2012)Toward holistic scene understanding: Feedback enabled cascaded classification models IEEE Trans Pattern Anal Mach Intell 34 1394-1408
[3]  
Ng AY(2018)A brief introduction to weakly supervised learning Natl Sci Rev 5 44-53
[4]  
Li C(2010)A theory of learning from different domains Mach Learn 79 151-175
[5]  
Kowdle A(2020)A survey of syntactic-semantic parsing based on constituent and dependency structures Sci China Tech Sci 63 1898-1920
[6]  
Saxena A(2019)A statistical parsimony method for uncertainty quantification of FDTD computation based on the PCA and ridge regression IEEE Trans Antennas Propagat 67 4726-4737
[7]  
Zhou Z H(2018)An adaptive least angle regression method for uncertainty quantification in FDTD computation IEEE Trans Antennas Propagat 66 7188-7197
[8]  
Ben-David S(2020)Recent advances in deep learning based sentiment analysis Sci China Tech Sci 63 1947-1970
[9]  
Blitzer J(2020)Representation learning in discourse parsing: A survey Sci China Tech Sci 63 1921-1946
[10]  
Crammer K(2020)Recent advances and challenges in task-oriented dialog systems Sci China Tech Sci 63 2011-2027