Use of CNNs for Estimating Depth from Stereo Images

被引:0
作者
Satushe, Vaidehi [1 ]
Vyas, Vibha [1 ]
机构
[1] COEP Technol Univ COEP Tech, Dept Elect & Telecommun Engn, Pune, Maharashtra, India
来源
SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024 | 2024年 / 945卷
关键词
Winner takes all (WTA); Disparity space image (DSI); Free viewpoint television (FVT); Cross-based cost aggregation; FIELDS;
D O I
10.1007/978-981-97-1320-2_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to build disparity maps from stereo images, this research investigates the advantages of employing CNN (convolutional neural networks) to generate a disparity space image. A total of 3100 neurons are used to build an eight-layer fully connected network that is trained using 220,000 examples of both positive and negative image patch samples. Using a context-based technique, the disparity space image is aggregated. The WTA technique is used to create the disparity map. There are noticeable visual improvements as compared to the simple subtractive plane-sweep technique, particularly when there is little to no texture.
引用
收藏
页码:45 / 58
页数:14
相关论文
共 18 条
[1]   Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3D-TV [J].
Fehn, C .
STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS XI, 2004, 5291 :93-104
[2]   Low bandwidth stereoscopic image encoding and transmission [J].
Flack, J ;
Harman, P ;
Fox, S .
STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS X, 2003, 5006 :206-214
[3]  
Güney F, 2015, PROC CVPR IEEE, P4165, DOI 10.1109/CVPR.2015.7299044
[4]   Spatial and Angular Resolution Enhancement of Light Fields Using Convolutional Neural Networks [J].
Gul, M. Shahzeb Khan ;
Gunturk, Bahadir K. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) :2146-2159
[5]   Ensemble Learning for Confidence Measures in Stereo Vision [J].
Haeusler, Ralf ;
Nair, Rahul ;
Kondermann, Daniel .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :305-312
[6]  
Häne C, 2015, PROC CVPR IEEE, P381, DOI 10.1109/CVPR.2015.7298635
[7]   3D Scene Reconstruction from Multiple Spherical Stereo Pairs [J].
Kim, Hansung ;
Hilton, Adrian .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 104 (01) :94-116
[8]  
Kim Hansung, 2010, P IEEE S 3D DAT PROC
[9]  
Klaus A, 2006, INT C PATT RECOG, P15
[10]  
Mukati MU, 2017, SIG PROCESS COMMUN