Automatic Railroad Track Components Inspection Using Hybrid Deep Learning Framework

被引:43
作者
Wu, Yunpeng [1 ]
Chen, Ping [2 ]
Qin, Yong [2 ]
Qian, Yu [3 ]
Xu, Fei [4 ]
Jia, Limin [2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Transportat Engn, Kunming 650031, Yunnan, Peoples R China
[2] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[3] Univ South Carolina, Dept Civil & Environm Engn, Columbia, SC 29208 USA
[4] Shijiazhuang Tiedao Univ, Sch Safety Engn & Emergency Management, Shijiazhuang 050043, Hebei, Peoples R China
基金
中国国家自然科学基金; 国家自然科学基金重大项目;
关键词
Inspection; Rails; Image segmentation; Fasteners; Safety; IP networks; Feature extraction; Convolutional neural network (CNN); object detection; railroad safety; saliency segmentation; track component inspection; unmanned aerial vehicle (UAV) images; SYSTEM;
D O I
10.1109/TIM.2023.3265636
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Regular inspections on track components, such as the clip, spike, and rail, are essential to maintain track quality and ensure railroad operating safety. Unfortunately, traditional image processing (IP)-based systems have limited accuracy. Existing convolutional neural network (CNN)-based approaches are designed for either detection or saliency segmentation of a specific track component (e.g., fastener or rail only). The overall track condition could not be evaluated because not all the track components are inspected simultaneously. This article presents an all-in-one YOLO (AOYOLO) framework for multitask track component inspection. First, a newly developed ConvNeXt-based backbone is constructed in AOYOLO to produce suitable hyperfeatures for both detection and segmentation tasks. Second, a novel U-shaped salient object segmentation branch is incorporated into AOYOLO to supplement the object detection branch, improving both the rail surface defect (RSD) segmentation and the detection of other components. Advanced data augmentations are integrated to further enhance the accuracy and scalability of the network. Extensive experiments conducted on a track dataset established with images taken by drone indicate that the proposed system is able to: 1) achieve 95.6% mean average precision (mAP) for track components inspection at a real-time speed of 147 frames/s and 2) reach 93.6% accuracy on RSDs detection, which surpass the current state-of-the-art (SOTA) models. The exceptional inference speed and superior detection accuracy have great potential for field applications.
引用
收藏
页数:15
相关论文
共 49 条
[1]  
AlNaimi N. R., 2020, THESIS QATAR U DOHA
[2]  
[Anonymous], RAILWAY TECHNOLOGY R
[3]  
[Anonymous], FEDERAL RAILROAD ADM
[4]   LIGHTNESS, BRIGHTNESS, AND BRIGHTNESS CONTRAST .2. REFLECTANCE VARIATION [J].
AREND, LE ;
SPEHAR, B .
PERCEPTION & PSYCHOPHYSICS, 1993, 54 (04) :457-468
[5]  
Ba JL, 2016, arXiv
[6]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[7]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[8]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[9]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[10]   Fault Diagnosis of High-Speed Train Bogie Based on Capsule Network [J].
Chen, Lingling ;
Qin, Na ;
Dai, Xi ;
Huang, Deqing .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (09) :6203-6211