A rail defect detection framework under class-imbalanced conditions based on improved you only look once network

被引:0
作者
Ding, Yu [2 ,3 ]
Zhao, Qin [2 ,3 ]
Li, Tianhao [2 ,3 ]
Lu, Chen [1 ,2 ,3 ]
Tao, Laifa [1 ,2 ,3 ]
Ma, Jian [1 ,2 ,3 ]
机构
[1] Beihang Univ, Hangzhou Int Innovat Inst, Hangzhou 311115, Peoples R China
[2] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
[3] Sci & Technol Reliabil & Environm Engn Lab, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail defect detection; Class-imbalanced condition; Diffusion model; You only look once network; Attention mechanism; Feature fusion;
D O I
10.1016/j.engappai.2024.109351
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In real rail operations, defects that can lead to serious accidents occur at very low frequencies, resulting in sample scarcity and class imbalances in rail defect datasets. Under imbalanced conditions, rail defect detection models tend to be biased toward majority classes and ignore minority classes, which further leads to inaccurate defect detection results. Therefore, a two-stage rail defect detection framework based on a latent diffusion model (LDM) and an improved You Only Look Once (YOLO) network operating under imbalanced conditions is proposed. This framework aims to enhance the detection performance achieved on imbalanced defect datasets through data augmentation and model improvements. First, the LDM is used to generate many defects with extremely small sample sizes and provide high-quality generated samples to expand the original imbalanced dataset. Furthermore, a coordinate attention module and a feature fusion module are integrated into the original YOLO version 8 (YOLOv8) model to improve its detection capabilities on imbalanced datasets. The coordinate attention mechanism enhances its focus on the positional information of various defects, whereas the feature fusion module enhances its ability to fuse the multiscale features of different defects. The results of the case study demonstrate that sample generation and filtration can provide high-quality samples for dataset augmentation purposes, alleviating the impact of minority defects on the overall detection accuracy. The results of the comparison and ablation experiments show that the improved YOLOv8 model has better detection performance than that of the comparison methods due to its introduction of coordinate attention and feature fusion modules.
引用
收藏
页数:14
相关论文
共 48 条
[1]  
Austin J, 2021, ADV NEUR IN
[2]   Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498
[3]  
Chen NX, 2020, Arxiv, DOI arXiv:2009.00713
[4]   Object detection using YOLO: challenges, architectural successors, datasets and applications [J].
Diwan, Tausif ;
Anirudh, G. ;
Tembhurne, Jitendra, V .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) :9243-9275
[5]  
Faghih-Roohi S, 2016, IEEE IJCNN, P2584, DOI 10.1109/IJCNN.2016.7727522
[6]   Research on deep learning method for rail surface defect detection [J].
Feng, Jiang Hua ;
Yuan, Hao ;
Hu, Yun Qing ;
Lin, Jun ;
Liu, Shi Wang ;
Luo, Xiao .
IET ELECTRICAL SYSTEMS IN TRANSPORTATION, 2020, 10 (04) :436-442
[7]  
Fu C. Y., 2017, arXiv
[8]   Non-contact detection of railhead defects and their classification by using convolutional neural network [J].
Ghafoor, Imran ;
Tse, Peter W. ;
Munir, Nauman ;
Trappey, Amy J. C. .
OPTIK, 2022, 253
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587