Graph Attention Layer Evolves Semantic Segmentation for Road Pothole Detection: A Benchmark and Algorithms

被引:55
作者
Fan, Rui [1 ,2 ]
Wang, Hengli [3 ]
Wang, Yuan [4 ]
Liu, Ming [3 ]
Pitas, Ioannis [5 ]
机构
[1] Tongji Univ, Dept Control Sci & Engn, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[2] Shanghai Res Inst Intelligent Autonomous Syst, Shanghai 201210, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[4] SmartMore, Ind Res & Dev Ctr, Shenzhen 518000, Peoples R China
[5] Univ Thessaloniki, Sch Informat, Thessaloniki 54124, Greece
关键词
Roads; Image segmentation; Semantics; Convolutional neural networks; Feature extraction; Computer architecture; Benchmark testing; Road pothole detection; machine learning; convolutional neural network; graph neural network;
D O I
10.1109/TIP.2021.3112316
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing road pothole detection approaches can be classified as computer vision-based or machine learning-based. The former approaches typically employ 2D image analysis/ understanding or 3D point cloud modeling and segmentation algorithms to detect (i.e., recognize and localize) road potholes from vision sensor data, e.g., RGB images and/or depth/disparity images. The latter approaches generally address road pothole detection using convolutional neural networks (CNNs) in an end-to-end manner. However, road potholes are not necessarily ubiquitous and it is challenging to prepare a large well-annotated dataset for CNN training. In this regard, while computer vision-based methods were the mainstream research trend in the past decade, machine learning-based methods were merely discussed. Recently, we published the first stereo vision-based road pothole detection dataset and a novel disparity transformation algorithm, whereby the damaged and undamaged road areas can be highly distinguished. However, there are no benchmarks currently available for state-of-the-art (SoTA) CNNs trained using either disparity images or transformed disparity images. Therefore, in this paper, we first discuss the SoTA CNNs designed for semantic segmentation and evaluate their performance for road pothole detection with extensive experiments. Additionally, inspired by graph neural network (GNN), we propose a novel CNN layer, referred to as graph attention layer (GAL), which can be easily deployed in any existing CNN to optimize image feature representations for semantic segmentation. Our experiments compare GAL-DeepLabv3+, our best-performing implementation, with nine SoTA CNNs on three modalities of training data: RGB images, disparity images, and transformed disparity images. The experimental results suggest that our proposed GAL-DeepLabv3+ achieves the best overall pothole detection accuracy on all training data modalities. The source code, dataset, and benchmark are publicly available at mias.group/GAL-Pothole-Detection.
引用
收藏
页码:8144 / 8154
页数:11
相关论文
共 39 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]  
Battaglia Peter W., 2018, Relational inductive biases, deep learning, and graph networks
[3]  
Chen L C., 2017, RETHINKING ATROUS CO, P1
[4]   CaMap: Camera-based Map Manipulation on Mobile Devices [J].
Chen, Liang ;
Chen, Dongyi .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[5]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[6]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[7]   Pothole Detection Using Computer Vision and Learning [J].
Dhiman, Amita ;
Klette, Reinhard .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (08) :3536-3550
[8]   Pavement distress detection and classification based on YOLO network [J].
Du, Yuchuan ;
Pan, Ning ;
Xu, Zihao ;
Deng, Fuwen ;
Shen, Yu ;
Kang, Hua .
INTERNATIONAL JOURNAL OF PAVEMENT ENGINEERING, 2021, 22 (13) :1659-1672
[9]   Rethinking Road Surface 3-D Reconstruction and Pothole Detection: From Perspective Transformation to Disparity Map Segmentation [J].
Fan, Rui ;
Ozgunalp, Umar ;
Wang, Yuan ;
Liu, Ming ;
Pitas, Ioannis .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) :5799-5808
[10]   Road Damage Detection Based on Unsupervised Disparity Map Segmentation [J].
Fan, Rui ;
Liu, Ming .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (11) :4906-4911