Attention-Based Multi-Scale Convolutional Neural Network (A plus MCNN) for Multi-Class Classification in Road Images

被引:40
作者
Eslami, Elham [1 ]
Yun, Hae-Bum [1 ]
机构
[1] Univ Cent Florida, Civil Environm & Construct Engn Dept, Orlando, FL 32816 USA
关键词
smart infrastructure assessment; road safety; automated pavement condition assessment; convolutional neural network; deep learning; PAVEMENT CRACK DETECTION; OBJECT DETECTION; SCENE;
D O I
10.3390/s21155137
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Automated pavement distress recognition is a key step in smart infrastructure assessment. Advances in deep learning and computer vision have improved the automated recognition of pavement distresses in road surface images. This task remains challenging due to the high variation of defects in shapes and sizes, demanding a better incorporation of contextual information into deep networks. In this paper, we show that an attention-based multi-scale convolutional neural network (A+MCNN) improves the automated classification of common distress and non-distress objects in pavement images by (i) encoding contextual information through multi-scale input tiles and (ii) employing a mid-fusion approach with an attention module for heterogeneous image contexts from different input scales. A+MCNN is trained and tested with four distress classes (crack, crack seal, patch, pothole), five non-distress classes (joint, marker, manhole cover, curbing, shoulder), and two pavement classes (asphalt, concrete). A+MCNN is compared with four deep classifiers that are widely used in transportation applications and a generic CNN classifier (as the control model). The results show that A+MCNN consistently outperforms the baselines by 1 similar to 26% on average in terms of the F-score. A comprehensive discussion is also presented regarding how these classifiers perform differently on different road objects, which has been rarely addressed in the existing literature.
引用
收藏
页数:27
相关论文
共 70 条
[1]   Automatic Pixel-Level Pavement Crack Detection Using Information of Multi-Scale Neighborhoods [J].
Ai, Dihao ;
Jiang, Guiyuan ;
Kei, Lam Siew ;
Li, Chengwu .
IEEE ACCESS, 2018, 6 :24452-24463
[2]  
Anand S., 2018, PROC IEEE INT C POWE, P1
[3]  
[Anonymous], ARXIV150204623
[4]  
[Anonymous], 2021, ASCES 2021 INFR REP
[5]  
[Anonymous], ARXIV150804025
[6]  
[Anonymous], ARXIV14127062
[7]  
[Anonymous], 2018, PROC CVPR IEEE, DOI DOI 10.1109/TPAMI.2019.2913372
[8]  
[Anonymous], 2014, NEURAL INFORM PROCES
[9]  
Ba J., ARXIV14127755
[10]   Encoder-decoder network for pixel-level road crack detection in black-box images [J].
Bang, Seongdeok ;
Park, Somin ;
Kim, Hongjo ;
Kim, Hyoungkwan .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2019, 34 (08) :713-727