Evaluating Noise-Robustness of Convolutional and Recurrent Neural Networks for Baby Cry Recognition

被引:0
作者
Renanti, Medhanita Dewi [1 ,2 ]
Buono, Agus [3 ]
Priandana, Karlisa [3 ]
Wijaya, sony Hartono [3 ]
机构
[1] IPB Univ, Doctoral Study Program Comp Dept, Bogor, Indonesia
[2] IPB Univ, Coll Vocat Studies, Software Engn Technol, Bogor, Indonesia
[3] IPB Univ, Dept Comp, Bogor, Indonesia
关键词
Baby cry recognition; deep learning; gated recurrent unit; long short-term memory; noise robustness; signal- to-noise ratio;
D O I
10.14569/IJACSA.2024.0150660
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reliable baby cry recognition plays a crucial role in infant care and monitoring, yet real-world environment poses challenges to system accuracy due to its background noises. This study proposes a novel CNN architecture for baby cry recognition under varying noise conditions, featuring three convolutional layers, a max pooling layer, and 0.5 dropout set, and compares its performance against standard RNN models. The models were trained for 100 epochs with a batch size of 64 and evaluated in both clean and noisy environments. To simulate real-world scenarios, recordings were transformed into audio signals and subjected to varying levels of background noise, particularly at different signal-to-noise ratios (SNRs). Results indicate that both models achieved high accuracy (>89%) in noise-free conditions. However, the proposed CNN maintained higher precision (93%) and overall accuracy (91%) than the RNN under 10dB noise, demonstrating its superior noise robustness for baby cry recognition. This improvement is attri buted to the CNN's capacity to capture spatial features in audio signals, making it susceptible to noise disruptions. These findings contribute to the development of more reliable and robust baby cry recognition systems.
引用
收藏
页码:585 / 593
页数:9
相关论文
共 50 条
[31]   Convolutional neural networks for ship type recognition [J].
Rainey, Katie ;
Reeder, John D. ;
Corelli, Alexander G. .
AUTOMATIC TARGET RECOGNITION XXVI, 2016, 9844
[32]   TOWARD AIRCRAFT RECOGNITION WITH CONVOLUTIONAL NEURAL NETWORKS [J].
Mash, Robert ;
Becherer, Nicholas ;
Woolley, Brian ;
Pecarina, John .
PROCEEDINGS OF THE 2016 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON) AND OHIO INNOVATION SUMMIT (OIS), 2016, :225-232
[33]   An Evaluation of Convolutional Neural Networks on Material Recognition [J].
Shang, Xiaowei ;
Xu, Ying ;
Qi, Lin ;
Madessa, Amanuel Hirpa ;
Dong, Junyu .
2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
[34]   Human Activity Recognition with Convolutional Neural Networks [J].
Bevilacqua, Antonio ;
MacDonald, Kyle ;
Rangarej, Aamina ;
Widjaya, Venessa ;
Caulfield, Brian ;
Kechadi, Tahar .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT III, 2019, 11053 :541-552
[35]   Driving Posture Recognition by Convolutional Neural Networks [J].
Yan, Chao ;
Zhang, Bailing ;
Coenen, Frans .
2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, :680-685
[36]   Convolutional Neural Networks for the Recognition of Malayalam Characters [J].
Anil, R. ;
Manjusha, K. ;
Kumar, S. Sachin ;
Soman, K. P. .
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 2, 2015, 328 :493-500
[37]   Recognition of flowers using convolutional neural networks [J].
Alkhonin, Abdulrahman ;
Almutairi, Abdulelah ;
Alburaidi, Abdulmajeed ;
Saudagar, Abdul Khader Jilani .
INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2020, 8 (03) :186-197
[38]   Improved Sensor Based Human Activity Recognition via Hybrid Convolutional and Recurrent Neural Networks [J].
Perez-Gamboa, Sonia ;
Sun, Qingquan ;
Zhang, Yan .
2021 8TH IEEE INTERNATIONAL SYMPOSIUM ON INERTIAL SENSORS AND SYSTEMS (INERTIAL 2021), 2021,
[39]   A Hybrid Model for Soybean Yield Prediction Integrating Convolutional Neural Networks, Recurrent Neural Networks, and Graph Convolutional Networks [J].
Ingole, Vikram S. ;
Kshirsagar, Ujwala A. ;
Singh, Vikash ;
Yadav, Manish Varun ;
Krishna, Bipin ;
Kumar, Roshan .
COMPUTATION, 2025, 13 (01)
[40]   Emotion Recognition from Speech using Artificial Neural Networks and. Recurrent Neural Networks [J].
Sharma, Shambhavi .
2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, :153-158