Multiscale Attention Networks for Pavement Defect Detection

被引:44
作者
Chen, Junde [1 ]
Wen, Yuxin [1 ]
Nanehkaran, Yaser Ahangari [2 ]
Zhang, Defu [3 ]
Zeb, Adnan [4 ]
机构
[1] Chapman Univ, Dale E & Sarah Ann Fowler Sch Engn, Orange, CA 92866 USA
[2] Yancheng Teachers Univ, Sch Informat Engn, Yancheng 224000, Peoples R China
[3] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
[4] Southern Univ Sci & Technol, Coll Engn, Shenzhen 518000, Peoples R China
关键词
Attention module; deep neural network; image identification; multiscale convolution; pavement defect detection; CRACK DETECTION; SYSTEM;
D O I
10.1109/TIM.2023.3298391
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning (DL)-based convolution neural networks (CNNs) has shown competitive performance in image detection and classification. To detect pavement defects automatically and improve effects, a multiscale mobile attention-based network, which we termed MANet, is proposed to perform the detection of pavement defects. The architecture of the encoder-decoder is used in MANet, where the encoder adopts the MobileNet as the backbone network to extract pavement defect features. Instead of the original 3 x 3 convolution, the multiscale convolution kernels are used in depthwise separable convolution (DSConv) layers of the network. Furthermore, the hybrid attention mechanism is separately incorporated into the encoder and decoder modules to infer the significance of spatial points and interchannel relationship features for the input intermediate feature maps. The proposed approach achieves state-of-the-art performance on two publicly available benchmark datasets, i.e., the Crack500 (500 crack images with 2000 x 1500 pixels) and CFD (118 crack images with 480 x 320 pixels) datasets. The mean intersection over union (MIoU) of the proposed approach on these two datasets reaches 0.7219 and 0.7788, respectively. Ablation experiments show that the multiscale convolution and hybrid attention modules can effectively help the model extract high-level feature representations and generate more accurate pavement crack segmentation results. We further test the model on locally collected pavement crack images (131 images with 1024 x 768 pixels) and it achieves a satisfactory result. The proposed approach realizes the MIoU of 0.6514 on the local dataset and outperforms other compared baseline methods. Experimental findings demonstrate the validity and feasibility of the proposed approach and it provides a viable solution for pavement crack detection in practical application scenarios. Our code is available at https://github.com/xtu502/pavement-defects.
引用
收藏
页数:12
相关论文
共 46 条
[1]   Attention-based generative adversarial network with internal damage segmentation using thermography [J].
Ali, Rahmat ;
Cha, Young-Jin .
AUTOMATION IN CONSTRUCTION, 2022, 141
[2]   Evaluating pavement cracks with bidimensional empirical mode decomposition [J].
Ayenu-Prah, Albert ;
Attoh-Okine, Nii .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
[3]   Attention Augmented Convolutional Networks [J].
Bello, Irwan ;
Zoph, Barret ;
Vaswani, Ashish ;
Shlens, Jonathon ;
Le, Quoc V. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3285-3294
[4]   Autonomous Structural Visual Inspection Using Region-Based Deep Learning for Detecting Multiple Damage Types [J].
Cha, Young-Jin ;
Choi, Wooram ;
Suh, Gahyun ;
Mahmoudkhani, Sadegh ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2018, 33 (09) :731-747
[5]   Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks [J].
Cha, Young-Jin ;
Choi, Wooram ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2017, 32 (05) :361-378
[6]  
Chanda Sukalpa, 2014, Artificial Neural Networks in Pattern Recognition. 6th IAPR TC 3 International Workshop, ANNPR 2014. Proceedings: LNCS 8774, P193, DOI 10.1007/978-3-319-11656-3_18
[7]   Data-Driven Fault Detection for Dynamic Systems With Performance Degradation: A Unified Transfer Learning Framework [J].
Chen, Hongtian ;
Chai, Zheng ;
Jiang, Bin ;
Huang, Biao .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70 (70)
[8]   Block-based automatic road defect recognition approach [J].
Chen, Junde ;
Ul Haq, Anwar ;
Zhang, Defu .
JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (05)
[9]   Pavement crack detection using hessian structure propagation [J].
Chen, Qi ;
Huang, Yuchun ;
Sun, Hui ;
Huang, Weihong .
ADVANCED ENGINEERING INFORMATICS, 2021, 49
[10]   SDDNet: Real-Time Crack Segmentation [J].
Choi, Wooram ;
Cha, Young-Jin .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (09) :8016-8025