Apply Fuzzy Mask to Improve Monocular Depth Estimation

被引:1
作者
Chen, Hsuan [1 ]
Chen, Hsiang-Chieh [2 ]
Sun, Chung-Hsun [3 ]
Wang, Wen-June [1 ]
机构
[1] Natl Cent Univ, Dept Elect Engn, 300 Zhongda Rd, Taoyuan 320317, Taiwan
[2] Natl Cent Univ, Dept Mech Engn, 300 Zhongda Rd, Taoyuan 320317, Taiwan
[3] Natl Kaohsiung Univ Sci & Technol, Dept Elect Engn, 415 Jiangong Rd, Kaohsiung 807618, Taiwan
关键词
Fuzzy mask; Binocular overlap; Monocular depth estimation; Deep learning;
D O I
10.1007/s40815-023-01657-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A fuzzy mask applied to pixel-wise dissimilarity weighting is proposed to improve the monocular depth estimation in this study. The parameters in the monocular depth estimation model are learned unsupervised through the image reconstruction of binocular images. The significant reconstructed dissimilarity, which is challenging to reduce, always occurs at pixels outside the binocular overlap. The fuzzy mask is designed based on the binocular overlap to adjust the weight of the dissimilarity for each pixel. More than 68% of pixels with significant dissimilarity outside binocular overlap are suppressed with weights less than 0.5. The model with the proposed fuzzy mask would focus on learning the depth estimation for pixels within binocular overlap. Experiments on the KITTI dataset show that the inference of the fuzzy mask only increases the training time of the model by less than 1%, while the number of pixels whose depth is accurately estimated enhances, and the monocular depth estimation also improves.
引用
收藏
页码:1143 / 1157
页数:15
相关论文
共 36 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   New Extension of Fuzzy-Weighted Zero-Inconsistency and Fuzzy Decision by Opinion Score Method Based on Cubic Pythagorean Fuzzy Environment: A Benchmarking Case Study of Sign Language Recognition Systems [J].
Alamoodi, A. H. ;
Albahri, O. S. ;
Zaidan, A. A. ;
AlSattar, H. A. ;
Ahmed, Mohamed A. ;
Pamucar, Dragan ;
Zaidan, B. B. ;
Albahri, A. S. ;
Mahmoud, Mohammed S. .
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2022, 24 (04) :1909-1926
[3]  
Bian JW, 2019, ADV NEUR IN, V32
[4]  
Chang Shu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P572, DOI 10.1007/978-3-030-58529-7_34
[5]   Multi-Step Data Prediction in Wireless Sensor Networks Based on One-Dimensional CNN and Bidirectional LSTM [J].
Cheng, Hongju ;
Xie, Zhe ;
Shi, Yushi ;
Xiong, Naixue .
IEEE ACCESS, 2019, 7 :117883-117896
[6]   Development of Effective Methods for Structural Image Recognition Using the Principles of Data Granulation and Apparatus of Fuzzy Logic [J].
Daradkeh, Yousef Ibrahim ;
Tvoroshenko, Iryna ;
Gorokhovatskyi, Volodymyr ;
Latiff, Liza Abdul ;
Ahmad, Norulhusna .
IEEE ACCESS, 2021, 9 :13417-13428
[7]  
Eigen D, 2014, ADV NEUR IN, V27
[8]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[9]   Digging Into Self-Supervised Monocular Depth Estimation [J].
Godard, Clement ;
Mac Aodha, Oisin ;
Firman, Michael ;
Brostow, Gabriel .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3827-3837
[10]   Unsupervised Monocular Depth Estimation with Left-Right Consistency [J].
Godard, Clement ;
Mac Aodha, Oisin ;
Brostow, Gabriel J. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6602-6611