Data Augmentation and Deep Learning Methods in Sound Classification: A Systematic Review

被引：41

作者：

Abayomi-Alli, Olusola O. ^{[1
]}

Damasevicius, Robertas ^{[1
]}

Qazi, Atika ^{[2
]}

Adedoyin-Olowe, Mariam ^{[3
]}

Misra, Sanjay ^{[4
]}

机构：

[1] Kaunas Univ Technol, Dept Software Engn, LT-44249 Kaunas, Lithuania

[2] Univ Brunei Darussalam, Ctr Lifelong Learning, BE-1410 Gadong, Brunei

[3] Birmingham City Univ, Sch Comp & Digital Technol, Birmingham B4 7XG, W Midlands, England

[4] Ostfold Univ Coll, Dept Comp Sci & Commun, N-1757 Halden, Norway

来源：

ELECTRONICS | 2022年 / 11卷 / 22期

关键词：

sound data; audio data; data augmentation; feature extraction; deep learning; ARTIFICIAL-INTELLIGENCE; EVENT CLASSIFICATION; FAULT-DIAGNOSIS; NEURAL-NETWORKS; RECOGNITION; SPEECH; FEATURES; AUDIO; BREATH;

D O I：

10.3390/electronics11223795

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The aim of this systematic literature review (SLR) is to identify and critically evaluate current research advancements with respect to small data and the use of data augmentation methods to increase the amount of data available for deep learning classifiers for sound (including voice, speech, and related audio signals) classification. Methodology: This SLR was carried out based on the standard SLR guidelines based on PRISMA, and three bibliographic databases were examined, namely, Web of Science, SCOPUS, and IEEE Xplore. Findings. The initial search findings using the variety of keyword combinations in the last five years (2017-2021) resulted in a total of 131 papers. To select relevant articles that are within the scope of this study, we adopted some screening exclusion criteria and snowballing (forward and backward snowballing) which resulted in 56 selected articles. Originality: Shortcomings of previous research studies include the lack of sufficient data, weakly labelled data, unbalanced datasets, noisy datasets, poor representations of sound features, and the lack of effective augmentation approach affecting the overall performance of classifiers, which we discuss in this article. Following the analysis of identified articles, we overview the sound datasets, feature extraction methods, data augmentation techniques, and its applications in different areas in the sound classification research problem. Finally, we conclude with the summary of SLR, answers to research questions, and recommendations for the sound classification task.

引用

页数：32

共 50 条

[1] Fractional-Order Calculus-Based Data Augmentation Methods for Environmental Sound Classification with Deep Learning
Yazgac, Bilgi Gorkem
Kirci, Murvet
FRACTAL AND FRACTIONAL, 2022, 6 (10)
[2] Deep Learning Methods for Heart Sounds Classification: A Systematic Review
Chen, Wei
Sun, Qiang
Chen, Xiaomin
Xie, Gangcai
Wu, Huiqun
Xu, Chen
ENTROPY, 2021, 23 (06)
[3] Explanations of Augmentation Methods for Deep Learning ECG Classification
Balasubramanian, Nikil Sharan Prabahar
Dakshit, Sagnik
ARTIFICIAL INTELLIGENCE IN MEDICINE, PT II, AIME 2024, 2024, 14845 : 277 - 287
[4] DEEP LEARNING METHODS FOR BREAST CANCER DETECTION AND CLASSIFICATION: A SYSTEMATIC REVIEW
Mousa, Tawfik ezat
Zouari, Ramzi
Baklouti, Mouna
Hamdi, Monia
Geoda, Mohamed S. M.
JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2025, 20 (01): : 209 - 234
[5] A comprehensive systematic review of deep learning methods for hyperspectral images classification
Ranjan, Pallavi
Girdhar, Ashish
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (17) : 6221 - 6306
[6] Effect of Data Augmentation in the Classification and Validation of Tomato Plant Disease with Deep Learning Methods
Wagle, Shivali Amit
Harikrishnan, R.
Sampe, Jahariah
Mohammad, Faseehuddin
Ali, Sawal Hamid Md
TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1657 - 1670
[7] METRIC LEARNING BASED DATA AUGMENTATION FOR ENVIRONMENTAL SOUND CLASSIFICATION
Lu, Rui
Duan, Zhiyao
Zhang, Changshui
2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 1 - 5
[8] Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
Salamon, Justin
Bello, Juan Pablo
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) : 279 - 283
[9] Machine Learning and Deep Learning Methods for Skin Lesion Classification and Diagnosis: A Systematic Review
Kassem, Mohamed A.
Hosny, Khalid M.
Damasevicius, Robertas
Eltoukhy, Mohamed Meselhy
DIAGNOSTICS, 2021, 11 (08)
[10] Data Augmentation Techniques to Detect Cervical Cancer Using Deep Learning: A Systematic Review
Wubineh, Betelhem Zewdu
Rusiecki, Andrzej
Halawa, Krzysztof
SYSTEM DEPENDABILITY-THEORY AND APPLICATIONS, DEPCOS-RELCOMEX 2024, 2024, 1026 : 325 - 336

← 1 2 3 4 5 →