Railway Track Inspection Using Deep Learning Based on Audio to Spectrogram Conversion: An on-the-Fly Approach

被引:18
作者
Hashmi, Muhammad Shadab Alam [1 ]
Ibrahim, Muhammad [2 ]
Bajwa, Imran Sarwar [2 ]
Siddiqui, Hafeez-Ur-Rehman [1 ]
Rustam, Furqan [1 ]
Lee, Ernesto [3 ]
Ashraf, Imran [4 ]
机构
[1] Khawaja Fareed Univ Engn & Informat Technol, Fac Comp Sci & Informat Technol, Rahim Yar Khan 64200, Pakistan
[2] Univ Bahawalpur, Dept Comp Sci, Bahawalpur 63100, Pakistan
[3] Broward Coll, Dept Comp Sci, Broward Cty, FL 33301 USA
[4] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan 38541, South Korea
关键词
railway track inspection; spectrograms; acoustic signals; machine learning; deep convolution neural networks; LSTM; CLASSIFICATION; MAINTENANCE; PERFORMANCE; DEFECTS; FUTURE; SCALE;
D O I
10.3390/s22051983
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The periodic inspection of railroad tracks is very important to find structural and geometrical problems that lead to railway accidents. Currently, in Pakistan, rail tracks are inspected by an acoustic-based manual system that requires a railway engineer as a domain expert to differentiate between different rail tracks' faults, which is cumbersome, laborious, and error-prone. This study proposes the use of traditional acoustic-based systems with deep learning models to increase performance and reduce train accidents. Two convolutional neural networks (CNN) models, convolutional 1D and convolutional 2D, and one recurrent neural network (RNN) model, a long short-term memory (LSTM) model, are used in this regard. Initially, three types of faults are considered, including superelevation, wheel burnt, and normal tracks. Contrary to traditional acoustic-based systems where the spectrogram dataset is generated before the model training, the proposed approach uses on-the-fly feature extraction by generating spectrograms as a deep learning model's layer. Different lengths of audio samples are used to analyze their performance with each model. Each audio sample of 17 s is split into 3 variations of 1.7, 3.4, and 8.5 s, and all 3 deep learning models are trained and tested against each split time. Various combinations of audio data augmentation are analyzed extensively to investigate models' performance. The results suggest that the LSTM with 8.5 split time gives the best results with the accuracy of 99.7%, the precision of 99.5%, recall of 99.5%, and F1 score of 99.5%.
引用
收藏
页数:21
相关论文
共 70 条
[1]   Rail Flaw Detection Technologies for Safer, Reliable Transportation: A Review [J].
Alahakoon, Sanath ;
Sun, Yan Quan ;
Spiryagin, Maksym ;
Cole, Colin .
JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2018, 140 (02)
[2]  
[Anonymous], 2017, P 34 INT C MACH LEAR
[3]  
[Anonymous], 2020, Number of employees of Dollar General in the United States from 2007 to 2019
[4]   Detection of cardiac abnormality from PCG signal using LMS based least square SVM classifier [J].
Ari, Samit ;
Hembram, Koushik ;
Saha, Goutam .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (12) :8019-8026
[5]   A GUIDED TOUR OF FAST FOURIER TRANSFORM [J].
BERGLAND, GD .
IEEE SPECTRUM, 1969, 6 (07) :41-+
[6]  
British Broadcasting Corporation, 2019, PAK TRAIN FIR AR ACC
[7]   Rail crack recognition based on Adaptive Weighting Multi-classifier Fusion Decision [J].
Chen, Wangcai ;
Liu, Wenbo ;
Li, Kaiyu ;
Wang, Ping ;
Zhu, Haixia ;
Zhang, Yanyan ;
Hang, Cheng .
MEASUREMENT, 2018, 123 :102-114
[8]   Fault diagnosis based on deep learning for current-carrying ring of catenary system in sustainable railway transportation [J].
Chen, Yuwen ;
Song, Bin ;
Zeng, Yuan ;
Du, Xiaojiang ;
Guizani, Mohsen .
APPLIED SOFT COMPUTING, 2021, 100
[9]   nnAudio: An on-the-Fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolutional Neural Networks [J].
Cheuk, Kin Wai ;
Anderson, Hans ;
Agres, Kat ;
Herremans, Dorien .
IEEE ACCESS, 2020, 8 :161981-162003
[10]   AN ALGORITHM FOR MACHINE CALCULATION OF COMPLEX FOURIER SERIES [J].
COOLEY, JW ;
TUKEY, JW .
MATHEMATICS OF COMPUTATION, 1965, 19 (90) :297-&