Pothole detection-you only look once: Deformable convolution based road pothole detection

被引:1
作者
Tang, Pei [1 ,2 ]
Lv, Mao [1 ,2 ]
Ding, Zhenyu [1 ,2 ]
Xu, Weikai [1 ,2 ]
Jiang, Minnan [1 ,2 ]
机构
[1] Yancheng Inst Technol, Coll Automot Engn, Yancheng, Peoples R China
[2] Yancheng Inst Technol, Jiangsu Coastal New Energy Vehicle Res Inst, Yancheng, Peoples R China
关键词
image capture; image classification; image sampling;
D O I
10.1049/ipr2.13300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection of road potholes plays a crucial role in ensuring passenger comfort and the structural safety of vehicles. To address the challenges of pothole detection in complex road environments, this paper proposes a model focusing on shape features (pothole detection you only look once, PD-YOLO). The model aims to overcome the limitations of multi-scale feature learning caused by the use of fixed convolutional kernels in the baseline model, by constructing a feature extraction module that better adapts to variations in the shape of potholes. Subsequently, a cross-stage partial network was designed using a one-time aggregation method, simplifying the model while enabling the network to fuse information between feature maps at different stages. Additionally, a dynamic sparse attention mechanism is introduced to select relevant features, reducing redundancy and suppressing background noise. Experiments conducted on the VOC2007 and GRDDC2020_Pothole datasets reveal that compared to the baseline model YOLOv8, PD-YOLO achieves improvements of 3.9% and 2.8% in mean average precision, with a frame rate of approximately 290 frames per second, effectively meeting the accuracy and real-time requirements for pothole detection. The code and dataset for this paper are located at: .
引用
收藏
页数:13
相关论文
共 30 条
[21]   DS-YOLOv8-Based Object Detection Method for Remote Sensing Images [J].
Shen, Lingyun ;
Lang, Baihe ;
Song, Zhengxun .
IEEE ACCESS, 2023, 11 :125122-125137
[22]  
Singh S., 2024, Manufacturing Technologies and Production Systems, P171
[23]   Thermal radiation image recognition camera using target detection techniques with human computer interaction [J].
Tan, Juan ;
He, Jin .
JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES, 2024, 17 (03)
[24]   UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios [J].
Wang, Gang ;
Chen, Yanfei ;
An, Pei ;
Hong, Hanyu ;
Hu, Jinghu ;
Huang, Tiange .
SENSORS, 2023, 23 (16)
[25]   Upgrade your network in-place with deformable convolution [J].
Xi, Wei ;
Sun, Li ;
Sun, Jun .
2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, :239-242
[26]   Road Surface Defect Detection-From Image-Based to Non-Image-Based: A Survey [J].
Yu, Jongmin ;
Jiang, Jiaqi ;
Fichera, Sebastiano ;
Paoletti, Paolo ;
Layzell, Lisa ;
Mehta, Devansh ;
Luo, Shan .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) :10581-10603
[27]   AAL-Net: A Lightweight Detection Method for Road Surface Defects Based on Attention and Data Augmentation [J].
Zhang, Cheng ;
Li, Gang ;
Zhang, Zekai ;
Shao, Rui ;
Li, Min ;
Han, Delong ;
Zhou, Mingle .
APPLIED SCIENCES-BASEL, 2023, 13 (03)
[28]   MED-YOLOv8s: a new real-time road crack, pothole, and patch detection model [J].
Zhao, Minghu ;
Su, Yaoheng ;
Wang, Jiuxin ;
Liu, Xinru ;
Wang, Kaihang ;
Liu, Zishen ;
Liu, Man ;
Guo, Zhou .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (02)
[29]   BiFormer: Vision Transformer with Bi-Level Routing Attention [J].
Zhu, Lei ;
Wang, Xinjiang ;
Ke, Zhanghan ;
Zhang, Wayne ;
Lau, Rynson .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :10323-10333
[30]   Deformable ConvNets v2: More Deformable, Better Results [J].
Zhu, Xizhou ;
Hu, Han ;
Lin, Stephen ;
Dai, Jifeng .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9300-9308