Pothole detection-you only look once: Deformable convolution based road pothole detection

被引：1

作者：

Tang, Pei ^{[1
,2
]}

Lv, Mao ^{[1
,2
]}

Ding, Zhenyu ^{[1
,2
]}

Xu, Weikai ^{[1
,2
]}

Jiang, Minnan ^{[1
,2
]}

机构：

[1] Yancheng Inst Technol, Coll Automot Engn, Yancheng, Peoples R China

[2] Yancheng Inst Technol, Jiangsu Coastal New Energy Vehicle Res Inst, Yancheng, Peoples R China

来源：

IET IMAGE PROCESSING | 2025年 / 19卷 / 01期

关键词：

image capture; image classification; image sampling;

D O I：

10.1049/ipr2.13300

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The detection of road potholes plays a crucial role in ensuring passenger comfort and the structural safety of vehicles. To address the challenges of pothole detection in complex road environments, this paper proposes a model focusing on shape features (pothole detection you only look once, PD-YOLO). The model aims to overcome the limitations of multi-scale feature learning caused by the use of fixed convolutional kernels in the baseline model, by constructing a feature extraction module that better adapts to variations in the shape of potholes. Subsequently, a cross-stage partial network was designed using a one-time aggregation method, simplifying the model while enabling the network to fuse information between feature maps at different stages. Additionally, a dynamic sparse attention mechanism is introduced to select relevant features, reducing redundancy and suppressing background noise. Experiments conducted on the VOC2007 and GRDDC2020_Pothole datasets reveal that compared to the baseline model YOLOv8, PD-YOLO achieves improvements of 3.9% and 2.8% in mean average precision, with a frame rate of approximately 290 frames per second, effectively meeting the accuracy and real-time requirements for pothole detection. The code and dataset for this paper are located at: .

引用

页数：13

共 30 条

[21] DS-YOLOv8-Based Object Detection Method for Remote Sensing Images [J].

Shen, Lingyun ;

Lang, Baihe ;

Song, Zhengxun .

IEEE ACCESS, 2023, 11 :125122-125137

[22]

Singh S., 2024, Manufacturing Technologies and Production Systems, P171

[23] Thermal radiation image recognition camera using target detection techniques with human computer interaction [J].

Tan, Juan ;

He, Jin .

JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES, 2024, 17 (03)

[24] UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios [J].

Wang, Gang ;

Chen, Yanfei ;

An, Pei ;

Hong, Hanyu ;

Hu, Jinghu ;

Huang, Tiange .

SENSORS, 2023, 23 (16)

[25] Upgrade your network in-place with deformable convolution [J].

Xi, Wei ;

Sun, Li ;

Sun, Jun .

2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, :239-242

[26] Road Surface Defect Detection-From Image-Based to Non-Image-Based: A Survey [J].

Yu, Jongmin ;

Jiang, Jiaqi ;

Fichera, Sebastiano ;

Paoletti, Paolo ;

Layzell, Lisa ;

Mehta, Devansh ;

Luo, Shan .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) :10581-10603

[27] AAL-Net: A Lightweight Detection Method for Road Surface Defects Based on Attention and Data Augmentation [J].

Zhang, Cheng ;

Li, Gang ;

Zhang, Zekai ;

Shao, Rui ;

Li, Min ;

Han, Delong ;

Zhou, Mingle .

APPLIED SCIENCES-BASEL, 2023, 13 (03)

[28] MED-YOLOv8s: a new real-time road crack, pothole, and patch detection model [J].

Zhao, Minghu ;

Su, Yaoheng ;

Wang, Jiuxin ;

Liu, Xinru ;

Wang, Kaihang ;

Liu, Zishen ;

Liu, Man ;

Guo, Zhou .

JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (02)

[29] BiFormer: Vision Transformer with Bi-Level Routing Attention [J].

Zhu, Lei ;

Wang, Xinjiang ;

Ke, Zhanghan ;

Zhang, Wayne ;

Lau, Rynson .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :10323-10333

[30] Deformable ConvNets v2: More Deformable, Better Results [J].

Zhu, Xizhou ;

Hu, Han ;

Lin, Stephen ;

Dai, Jifeng .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9300-9308

← 1 2 3 →