Data Augmentation and Deep Learning Methods in Sound Classification: A Systematic Review

被引:41
|
作者
Abayomi-Alli, Olusola O. [1 ]
Damasevicius, Robertas [1 ]
Qazi, Atika [2 ]
Adedoyin-Olowe, Mariam [3 ]
Misra, Sanjay [4 ]
机构
[1] Kaunas Univ Technol, Dept Software Engn, LT-44249 Kaunas, Lithuania
[2] Univ Brunei Darussalam, Ctr Lifelong Learning, BE-1410 Gadong, Brunei
[3] Birmingham City Univ, Sch Comp & Digital Technol, Birmingham B4 7XG, W Midlands, England
[4] Ostfold Univ Coll, Dept Comp Sci & Commun, N-1757 Halden, Norway
关键词
sound data; audio data; data augmentation; feature extraction; deep learning; ARTIFICIAL-INTELLIGENCE; EVENT CLASSIFICATION; FAULT-DIAGNOSIS; NEURAL-NETWORKS; RECOGNITION; SPEECH; FEATURES; AUDIO; BREATH;
D O I
10.3390/electronics11223795
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this systematic literature review (SLR) is to identify and critically evaluate current research advancements with respect to small data and the use of data augmentation methods to increase the amount of data available for deep learning classifiers for sound (including voice, speech, and related audio signals) classification. Methodology: This SLR was carried out based on the standard SLR guidelines based on PRISMA, and three bibliographic databases were examined, namely, Web of Science, SCOPUS, and IEEE Xplore. Findings. The initial search findings using the variety of keyword combinations in the last five years (2017-2021) resulted in a total of 131 papers. To select relevant articles that are within the scope of this study, we adopted some screening exclusion criteria and snowballing (forward and backward snowballing) which resulted in 56 selected articles. Originality: Shortcomings of previous research studies include the lack of sufficient data, weakly labelled data, unbalanced datasets, noisy datasets, poor representations of sound features, and the lack of effective augmentation approach affecting the overall performance of classifiers, which we discuss in this article. Following the analysis of identified articles, we overview the sound datasets, feature extraction methods, data augmentation techniques, and its applications in different areas in the sound classification research problem. Finally, we conclude with the summary of SLR, answers to research questions, and recommendations for the sound classification task.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Fractional-Order Calculus-Based Data Augmentation Methods for Environmental Sound Classification with Deep Learning
    Yazgac, Bilgi Gorkem
    Kirci, Murvet
    FRACTAL AND FRACTIONAL, 2022, 6 (10)
  • [2] Deep Learning Methods for Heart Sounds Classification: A Systematic Review
    Chen, Wei
    Sun, Qiang
    Chen, Xiaomin
    Xie, Gangcai
    Wu, Huiqun
    Xu, Chen
    ENTROPY, 2021, 23 (06)
  • [3] Explanations of Augmentation Methods for Deep Learning ECG Classification
    Balasubramanian, Nikil Sharan Prabahar
    Dakshit, Sagnik
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PT II, AIME 2024, 2024, 14845 : 277 - 287
  • [4] DEEP LEARNING METHODS FOR BREAST CANCER DETECTION AND CLASSIFICATION: A SYSTEMATIC REVIEW
    Mousa, Tawfik ezat
    Zouari, Ramzi
    Baklouti, Mouna
    Hamdi, Monia
    Geoda, Mohamed S. M.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2025, 20 (01): : 209 - 234
  • [5] A comprehensive systematic review of deep learning methods for hyperspectral images classification
    Ranjan, Pallavi
    Girdhar, Ashish
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (17) : 6221 - 6306
  • [6] Effect of Data Augmentation in the Classification and Validation of Tomato Plant Disease with Deep Learning Methods
    Wagle, Shivali Amit
    Harikrishnan, R.
    Sampe, Jahariah
    Mohammad, Faseehuddin
    Ali, Sawal Hamid Md
    TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1657 - 1670
  • [7] METRIC LEARNING BASED DATA AUGMENTATION FOR ENVIRONMENTAL SOUND CLASSIFICATION
    Lu, Rui
    Duan, Zhiyao
    Zhang, Changshui
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 1 - 5
  • [8] Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
    Salamon, Justin
    Bello, Juan Pablo
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) : 279 - 283
  • [9] Machine Learning and Deep Learning Methods for Skin Lesion Classification and Diagnosis: A Systematic Review
    Kassem, Mohamed A.
    Hosny, Khalid M.
    Damasevicius, Robertas
    Eltoukhy, Mohamed Meselhy
    DIAGNOSTICS, 2021, 11 (08)
  • [10] Data Augmentation Techniques to Detect Cervical Cancer Using Deep Learning: A Systematic Review
    Wubineh, Betelhem Zewdu
    Rusiecki, Andrzej
    Halawa, Krzysztof
    SYSTEM DEPENDABILITY-THEORY AND APPLICATIONS, DEPCOS-RELCOMEX 2024, 2024, 1026 : 325 - 336