Data augmentation on convolutional neural networks to classify mechanical noise

被引:17
作者
Abeysinghe, Asith [1 ]
Tohmuang, Sitthichart [1 ]
Davy, John Laurence [2 ]
Fard, Mohammad [1 ]
机构
[1] RMIT Univ, Sch Engn, Melbourne, Australia
[2] RMIT Univ, Sch Sci, Melbourne, Australia
关键词
Data augmentation; Squeak and rattle; Convolutional neural networks; Mel-frequency cepstral coefficients; Sound pattern recognition; ARTIFICIAL-INTELLIGENCE;
D O I
10.1016/j.apacoust.2023.109209
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Mechanical noise identification and classification are essential for automotive and machinery fault diag-nosis. The scarcity of labelled audio data for noise-related mechanical issues has constrained the utilisa-tion of complex, high-capacity machine learning models. In this research, the application of augmentation methods for labelled squeak and rattle datasets has proven that the accuracy of deep con-volutional neural networks can be improved. Data augmentation has eliminated common machine learn-ing issues such as overfitting observed in the models trained from a small dataset. The influence of different augmentation methods for the dataset was evaluated and compared based on classification accuracy. Different data augmentation methods have been tested to classify audio classes of different mechanical noises. The use of class-specific data augmentation leads to the development of more accu-rate machine learning models. Different combinations of data augmentations were investigated. This research showed that the new combined augmentation process could significantly improve the classi-fiers' accuracy. The proposed combined data augmentation technique achieved the highest precision with the lowest error rate for both squeak and rattle datasets. The proposed method can be applied to any type of mechanical noise identification and classification.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 33 条
[1]   Mel frequency cepstral coefficient temporal feature integration for classifying squeak and rattle noisea) [J].
Abeysinghe, Asith ;
Fard, Mohammad ;
Jazar, Reza ;
Zambetta, Fabio ;
Davy, John .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (01) :193-201
[2]   Deep-Learning-Based Approach to Anomaly Detection Techniques for Large Acoustic Data in Machine Operation [J].
Ahn, Hyojung ;
Yeo, Inchoon .
SENSORS, 2021, 21 (16)
[3]   Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features [J].
Anvarjon, Tursunov ;
Mustaqeem ;
Kwon, Soonil .
SENSORS, 2020, 20 (18) :1-16
[4]   Computational neural network in melanocytic lesions diagnosis: artificial intelligence to improve diagnosis in dermatology? [J].
Aractingi, Selim ;
Pellacani, Giovanni .
EUROPEAN JOURNAL OF DERMATOLOGY, 2019, 29 (Suppl 1) :4-7
[5]   Data Augmentation for Deep Neural Network Acoustic Modeling [J].
Cui, Xiaodong ;
Goel, Vaibhava ;
Kingsbury, Brian .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (09) :1469-1477
[6]   Audio Time Stretching Using Fuzzy Classification of Spectral Bins [J].
Damskagg, Eero-Pekka ;
Valimaki, Vesa .
APPLIED SCIENCES-BASEL, 2017, 7 (12)
[7]  
Dhakal P, 2018, IEEE INT SYMP SIGNAL, P78, DOI 10.1109/ISSPIT.2018.8642755
[8]  
Esa M. F. M., 2019, IOP Conference Series: Materials Science and Engineering, V551, DOI 10.1088/1757-899X/551/1/012066
[9]  
Hossan MA., 2010, 2010 4 INT C SIGN PR, P1, DOI DOI 10.1109/ICSPCS.2010.5709752
[10]   Time Series Data Augmentation for Neural Networks by Time Warping with a Discriminative Teacher [J].
Iwana, Brian Kenji ;
Uchida, Seiichi .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :3558-3565