Convolutional neural networks for road surface classification on aerial imagery

被引:0
作者
Pesek, Ondrej [1 ]
Krisztian, Lina [2 ]
Landa, Martin [1 ]
Metz, Markus [2 ]
Neteler, Markus [2 ]
机构
[1] Department of Geomatics, Faculty of Civil Engineering, Czech Technical University in Prague, Prague
[2] Mundialis GmbH & Co. KG, North Rhine-Westphalia, Bonn
关键词
CNN; Convolutional neural network; Land cover detection; Remote sensing; Road surface;
D O I
10.7717/PEERJ-CS.2571
中图分类号
学科分类号
摘要
Any place the human species inhabits is inevitably modified by them. One of the first features that appear everywhere, in urban areas as well as in the countryside or deep forests, are roads. Further, roads and streets in general reflect their omnipresent and significant role in our lives through the flow of goods, people, and even culture and information. However, their contribution to the public is highly influenced by their surface. Yet, research on automated road surface classification from remotely sensed data is peculiarly scarce. This work investigates the capacities of chosen convolutional neural networks (fully convolutional network (FCN), U-Net, SegNet, DeepLabv3+) on this task. We find that convolutional neural network (CNN) are capable of distinguishing between compact (asphalt, concrete) and modular (paving stones, tiles) surfaces for both roads and sidewalks on aerial data of spatial resolution of 10 cm. U-Net proved its position as the best-performing model among the tested ones, reaching an overall accuracy of nearly 92%. Furthermore, we explore the influence of adding a near-infrared band to the basic red green blue (RGB) scenes and stress where it should be used and where avoided. Overfitting strategies such as dropout and data augmentation undergo the same examination and clearly show their pros and cons. Convolutional neural networks are also compared to single-pixel based random forests and show indisputable advantage of the context awareness in convolutional neural networks, U-Net reaching almost 25% higher accuracy than random forests. We conclude that convolutional neural networks and U-Net in particular should be considered as suitable approaches for automated semantic segmentation of road surfaces on aerial imagery, while common overfitting strategies should only be used under particular conditions. © 2024 Pesek et al.
引用
收藏
相关论文
共 66 条
[1]  
Alexakis EB, Armenakis C., Improving CNN-based building semantic segmentation using object boundaries, ISPRS Congress: Imaging Today, Foreseeing Tomorrow, Commission III, volume 43 of International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, pp. 41-48, (2022)
[2]  
Assiss JC, Giacomini HC, Ribeiro MC., Road permeability index: evaluating the heterogeneous permeability of roads for wildlife crossing, Ecological Indicators, 99, pp. 365-374, (2019)
[3]  
Badrinarayanan V, Kendall A, Cipolla R., SegNet: a deep convolutional encoder-decoder architecture for image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 12, pp. 2481-2495, (2017)
[4]  
Barrington-Leigh C, Millard-Ball A., A century of sprawl in the United States, Proceedings of the National Academy of Sciences of the United States of America, 112, pp. 8244-8249, (2015)
[5]  
Barrington-Leigh C, Millard-Ball A., The world’s user-generated road map is more than 80% complete, PLOS ONE, 12, (2017)
[6]  
Breiman L., Random forests, Machine Learning, 45, 1, pp. 5-32, (2001)
[7]  
Cao H, Gao Y, Cai W, Xu Z, Li L., Segmentation detection method for complex road cracks collected by UAV based on hc-unet++, Drones, 7, 3, (2023)
[8]  
Cao X, Zhang K, Jiao L., Csanet: cross-scale axial attention network for road segmentation, Remote Sensing, 15, 1, (2023)
[9]  
Creative commons attribution 4.0 international, (2013)
[10]  
Global roads open access data set, version 1 (gROADSv1), (2013)