Deep semantic segmentation for visual scene understanding of soil types

被引:15
作者
Zamani, Vahid [1 ]
Taghaddos, Hosein [1 ]
Gholipour, Yaghob [1 ]
Pourreza, Hamidreza [2 ]
机构
[1] Univ Tehran, Coll Engn, Sch Civil Engn, Tehran, Iran
[2] Ferdowsi Univ Mashhad, Dept Comp Engn, Mashhad, Iran
关键词
Scene understanding; Soil classification; Visual soil assessment; Context-aware system; Computer vision; Deep learning; Deeplab v3+; Semantic segmentation; Image segmentation; Project management; IMAGE; CLASSIFICATION;
D O I
10.1016/j.autcon.2022.104342
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
One of the state-of-the-art computer vision applications is scene understanding and visual contextual awareness. Despite the numerous detection and classification-based studies, the literature lacks semantic segmentation methods for a more comprehensive and precise understanding of the soil included scene due to the scarcity of annotated datasets; the extracted information from an understood scene is worthwhile in project fleet management, claims management, equipment productivity analysis, safety, and soil classification. Hence, this study presents a vision-based approach for soil-included scene understanding and classifying them into different categories according to ASTM D2488, using semantic segmentation. An annotated dataset of various soil types containing 3043 images was developed to train four Deeplab v3+ variants with modified decoders. Five-fold cross-validation indicates the remarkable performance of the best variant with a mean Jaccard index of 0.9. The application and effects of subpixel upsampling and exit-flow CRF were also examined.
引用
收藏
页数:21
相关论文
共 78 条
[1]  
Aitken A. P., 2017, Checkerboard artifact free sub-pixel convolution: A note on sub-pixel convolution, resize convolution and convolution resize
[2]  
[Anonymous], 2017, ASTM STANDARDS TESTI, P1, DOI DOI 10.1520/D2488-17E01
[3]  
[Anonymous], 1999, SOIL TAX BAS SYST SO, V2nd, pWashington, DOI [DOI 10.1017/S0016756800045489, DOI 10.1111/J.1475-2743.2001.TB00008.X]
[4]  
[Anonymous], 2017, STANDARD SPECIFICATI
[5]   Scene understanding in construction and buildings using image processing methods: A comprehensive review and a case study [J].
Arashpour, Mehrdad ;
Tuan Ngo ;
Li, Heng .
JOURNAL OF BUILDING ENGINEERING, 2021, 33 (33)
[6]   Vision-based integrated mobile robotic system for real-time applications in construction [J].
Asadi, Khashayar ;
Ramshankar, Hariharan ;
Pullagurla, Harish ;
Bhandare, Aishwarya ;
Shanbhag, Suraj ;
Mehta, Pooja ;
Kundu, Spondon ;
Han, Kevin ;
Lobaton, Edgar ;
Wu, Tianfu .
AUTOMATION IN CONSTRUCTION, 2018, 96 :470-482
[7]   Deep semantic segmentation of natural and medical images: a review [J].
Asgari Taghanaki, Saeid ;
Abhishek, Kumar ;
Cohen, Joseph Paul ;
Cohen-Adad, Julien ;
Hamarneh, Ghassan .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :137-178
[8]   Image segmentation of underfloor scenes using a mask regions convolutional neural network with two-stage transfer learning [J].
Atkinson, Gary A. ;
Zhang, Wenhao ;
Hansen, Mark F. ;
Holloway, Mathew L. ;
Napier, Ashley A. .
AUTOMATION IN CONSTRUCTION, 2020, 113
[9]  
Bell F.G., 1992, ENG PROPERTIES SOILS, VThird, P1, DOI [10.1016/B978-0-7506-0489-5.50004-4, DOI 10.1016/B978-0-7506-0489-5.50004-4]
[10]   A systematic study of the class imbalance problem in convolutional neural networks [J].
Buda, Mateusz ;
Maki, Atsuto ;
Mazurowski, Maciej A. .
NEURAL NETWORKS, 2018, 106 :249-259