Pavement Distress Detection Based on Faster R-CNN and Morphological Operations

被引:0
作者
Yan B.-F. [1 ,2 ]
Xu G.-Y. [1 ]
Luan J. [3 ,4 ]
Lin D. [4 ]
Deng L. [1 ,2 ]
机构
[1] School of Civil Engineering, Hunan University, Changsha
[2] Key Laboratory for Wind and Bridge Engineering of Hunan Province, Hunan University, Changsha
[3] School of Material Science and Engineering, Central South University of Forestry and Technology, Changsha
[4] Yunnan Aerospace Engineering Geophysical Detecting Co. Ltd., Kunming
来源
Zhongguo Gonglu Xuebao/China Journal of Highway and Transport | 2021年 / 34卷 / 09期
基金
中国国家自然科学基金;
关键词
Distress detection; Faster R-CNN; Morphological operation; Pavement; Road engineering;
D O I
10.19721/j.cnki.1001-7372.2021.09.015
中图分类号
学科分类号
摘要
To improve the efficiency and accuracy of image-based pavement distress detection as well as quickly identify the type, location, and magnitude of the distress, the Faster R-CNN algorithm for object detection is introduced. A convolutional neural network (CNN) based on VGG16 migration learning and model fine tuning was employed in an extracted crack area with a bounding frame to locate the crack skeleton with a 50% overlap sliding window. Then, morphological operations were conducted to extract the crack skeleton and calculate its length and width. To improve the performance of Faster R-CNN and evaluate the effectiveness of the integrated algorithms where a low misdetection rate but high false-detection rate is likely, the precision, recall, and F1 score were introduced. The maximum F1 score was used to determine the pixel area of the distress frame and the corresponding confidence threshold thus reducing the false detection rate and adapting to the diverse scenarios of pavement surface distress. The rapid pavement distress detection algorithm was applied to an expressway in Guangdong, China. The test results on typical crack sample images show that the proposed method is more efficient than full-field image processing methods such as CNN with sliding window and traditional morphology operations. As the segmented crack bounding frames were merged and adjusted, and the optimized pixel area and confidence threshold for the distress boxes were considered. It was observed that the precision rate of the transverse crack increases from 0.861 to 0.918, whereas the false detection rates of the horizontal and vertical cracks decrease significantly from 20.4% and 23.8% to 8.2% and 6.9% before and after adjustment, respectively. The proposed pavement distress detection method integrating Faster R-CNN, CNN, and morphological operations has the advantages of high efficiency and low misdetection rate. Moreover, the false detection rates are greatly reduced by introducing the evaluation method and the thresholds of pixel area and confidence value, which indicates the engineering application potential of the proposed method. © 2021, Editorial Department of China Journal of Highway and Transport. All right reserved.
引用
收藏
页码:181 / 193
页数:12
相关论文
共 30 条
[1]  
MA Jian, ZHAO Xiang-mo, HE Shuan-hai, Et al., Review of Pavement Detection Technology, Journal of Traffic and Transportation Engineering, 17, 5, pp. 121-137, (2017)
[2]  
LECUN Y, BENGIO Y, HINTON G., Deep Learning, Nature, 521, 7553, pp. 436-444, (2015)
[3]  
LENG B, GUO S, ZHANG X, Et al., 3D Object Retrieval with Stacked Local Convolutional Autoencoder, Signal Processing, 112, pp. 119-128, (2015)
[4]  
SHI B, BAI X, YAO C., Script Identification in the Wild via Discriminative Convolutional Neural Network, Pattern Recognition, 52, pp. 448-458, (2016)
[5]  
BARAT C, DUCOTTET C., String Representations and Distances in Deep Convolutional Neural Networks for Image Classification, Pattern Recognition, 54, pp. 104-115, (2016)
[6]  
LECUN Y, BOSER B E, DENKER J S, Et al., Handwritten Digit Recognition with a Back-propagation Network, Advances in Neural Information Processing Systems, 1, pp. 396-404, (1990)
[7]  
LECUN Y, BOTTOU L, BENGIO Y, Et al., Gradient-based Learning Applied to Document Recognition [J], Proceedings of the IEEE, 86, 11, pp. 2278-2324, (1998)
[8]  
KRIZHEVSKY A, SUTSKEVER I, HINTON G E., ImageNet Classification with Deep Convolutional Neural Networks, Advances in Neural Information Processing Systems, 2, pp. 1097-1105, (2012)
[9]  
SIMONYAN K, ZISSERMAN A., Very Deep Convolutional Networks for Large-scale Image Recognition, Computer Science, 1, pp. 1-14, (2014)
[10]  
HE K, ZHANG X, REN S, Et al., Deep Residual Learning for Image Recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)