Evaluating Noise-Robustness of Convolutional and Recurrent Neural Networks for Baby Cry Recognition

被引:0
|
作者
Renanti, Medhanita Dewi [1 ,2 ]
Buono, Agus [3 ]
Priandana, Karlisa [3 ]
Wijaya, sony Hartono [3 ]
机构
[1] IPB Univ, Doctoral Study Program Comp Dept, Bogor, Indonesia
[2] IPB Univ, Coll Vocat Studies, Software Engn Technol, Bogor, Indonesia
[3] IPB Univ, Dept Comp, Bogor, Indonesia
关键词
Baby cry recognition; deep learning; gated recurrent unit; long short-term memory; noise robustness; signal- to-noise ratio;
D O I
10.14569/IJACSA.2024.0150660
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reliable baby cry recognition plays a crucial role in infant care and monitoring, yet real-world environment poses challenges to system accuracy due to its background noises. This study proposes a novel CNN architecture for baby cry recognition under varying noise conditions, featuring three convolutional layers, a max pooling layer, and 0.5 dropout set, and compares its performance against standard RNN models. The models were trained for 100 epochs with a batch size of 64 and evaluated in both clean and noisy environments. To simulate real-world scenarios, recordings were transformed into audio signals and subjected to varying levels of background noise, particularly at different signal-to-noise ratios (SNRs). Results indicate that both models achieved high accuracy (>89%) in noise-free conditions. However, the proposed CNN maintained higher precision (93%) and overall accuracy (91%) than the RNN under 10dB noise, demonstrating its superior noise robustness for baby cry recognition. This improvement is attri buted to the CNN's capacity to capture spatial features in audio signals, making it susceptible to noise disruptions. These findings contribute to the development of more reliable and robust baby cry recognition systems.
引用
收藏
页码:585 / 593
页数:9
相关论文
共 50 条
  • [1] Evaluating Convolutional Neural Networks and Vision Transformers for Baby Cry Sound Analysis
    Younis, Samir A.
    Sobhy, Dalia
    Tawfik, Noha S.
    FUTURE INTERNET, 2024, 16 (07)
  • [2] Baby Cry Recognition Using Deep Neural Networks
    Yong, Boon Fei
    Ting, Hua Nong
    Ng, Kwan Hoong
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING 2018, VOL 3, 2019, 68 (03): : 809 - 813
  • [3] Robustness of convolutional neural networks to physiological electrocardiogram noise
    Venton, J.
    Harris, P. M.
    Sundar, A.
    Smith, N. A. S.
    Aston, P. J.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2021, 379 (2212):
  • [4] Robustness of Deep Convolutional Neural Networks for Image Recognition
    Ulicny, Matej
    Lundstrom, Jens
    Byttner, Stefan
    INTELLIGENT COMPUTING SYSTEMS, 2016, 597 : 16 - 30
  • [5] CONVOLUTIONAL NEURAL NETWORKS FOR NOISE SIGNAL RECOGNITION
    Portsev, Ruslan J.
    Makarenko, Andrey V.
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [6] Robustness of convolutional neural networks in recognition of pigmented skin lesions
    Maron, Roman C.
    Haggenmueller, Sarah
    von Kalle, Christof
    Utikal, Jochen S.
    Meier, Friedegund
    Gellrich, Frank F.
    Hauschild, Axel
    French, Lars E.
    Schlaak, Max
    Ghoreschi, Kamran
    Kutzner, Heinz
    Heppt, Markus V.
    Haferkamp, Sebastian
    Sondermann, Wiebke
    Schadendorf, Dirk
    Schilling, Bastian
    Hekler, Achim
    Krieghoff-Henning, Eva
    Kather, Jakob N.
    Froehling, Stefan
    Lipka, Daniel B.
    Brinker, Titus J.
    EUROPEAN JOURNAL OF CANCER, 2021, 145 : 81 - 91
  • [7] On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition
    Bansal, Lokesh
    Dubagunta, S. Pavankumar
    Chetlur, Malolan
    Jagtap, Pushpak
    Ganapathiraju, Aravind
    INTERSPEECH 2023, 2023, : 1863 - 1867
  • [8] IMPROVING CONVOLUTIONAL RECURRENT NEURAL NETWORKS FOR SPEECH EMOTION RECOGNITION
    Meyer, Patrick
    Xu, Ziyi
    Fingscheidt, Tim
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 365 - 372
  • [9] Noise Immunity and Robustness Study of Image Recognition Using a Convolutional Neural Network
    Ziyadinov, Vadim
    Tereshonok, Maxim
    SENSORS, 2022, 22 (03)
  • [10] Gated Convolutional Recurrent Neural Networks for Multilingual Handwriting Recognition
    Bluche, Theodore
    Messina, Ronaldo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 646 - 651