Baby Cry Recognition Using Deep Neural Networks

被引:3
|
作者
Yong, Boon Fei [1 ]
Ting, Hua Nong [1 ]
Ng, Kwan Hoong [2 ]
机构
[1] Univ Malaya, Biomed Engn Dept, Fac Engn, Kuala Lumpur, Malaysia
[2] Univ Malaya, Dept Biomed Imaging, Kuala Lumpur, Malaysia
来源
WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING 2018, VOL 3 | 2019年 / 68卷 / 03期
关键词
Infant cry recognition; Restricted boltzmann machine; Convolution neural networks;
D O I
10.1007/978-981-10-9023-3_147
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Infant cry recognition is a challenging task as it is hard to determine the speech features that can allow researchers to clearly separate between different types of cries. However, baby cry is treated as a different way of communication of speech. The types of baby cry can be differentiated using Mel-Frequency Cepstral Coefficient (MFCC) with appropriate artificial intelligence model. Stacked restricted Boltzmann machine (RBN) is popular in providing few layers of neural networks to convert the high dimensional data to lower dimensional data to fine tune the input data to a better initialized weight for the neural networks. Usually RBN is used with another deep neural network to form the deep belief networks (DBN), and the studies in this direction is heading towards the convolutional-RBN variant. The study on RBN to pre-train Convolutional neural networks (CNN) without convolution function in the RBN meanwhile is scarce due to the Back propagation and principal component analysis can be applied directly to the CNN. In this paper, we describe the hybrid system between RBN and CNN for learning class specific features for baby cry recognition using the feature of Mel-Frequency Cepstral Coefficient. We archived an 78.6% of accuracy on 5 types of baby cries by validating the proposed model on baby cry recognition.
引用
收藏
页码:809 / 813
页数:5
相关论文
共 50 条
  • [1] Evaluating Noise-Robustness of Convolutional and Recurrent Neural Networks for Baby Cry Recognition
    Renanti, Medhanita Dewi
    Buono, Agus
    Priandana, Karlisa
    Wijaya, sony Hartono
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 585 - 593
  • [2] Baby Cry Recognition by BCRNet Using Transfer Learning and Deep Feature Fusion
    Zhang, Ke
    Ting, Hua-Nong
    Choo, Yao-Mun
    IEEE ACCESS, 2023, 11 : 126251 - 126262
  • [3] Monument Recognition using Deep Neural Networks
    Gada, Siddhant
    Mehta, Viraj
    Kanchan, Karan
    Jain, Chahat
    Raut, Purva
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 645 - 650
  • [4] Face Recognition using Deep Neural Networks
    Dastgiri, Amirhosein
    Jafarinamin, Pouria
    Kamarbaste, Sami
    Gholizade, Mahdi
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (03): : 510 - 527
  • [5] Face recognition by myopic baby neural networks
    Valentin, D
    Abdi, H
    INFANT AND CHILD DEVELOPMENT, 2001, 10 (1-2) : 19 - 20
  • [6] Barley disease recognition using deep neural networks
    Rezaei, Masoud
    Gupta, Sanjiv
    Diepeveen, Dean
    Laga, Hamid
    Jones, Michael G. K.
    Sohel, Ferdous
    EUROPEAN JOURNAL OF AGRONOMY, 2024, 161
  • [7] Visual Emotion Recognition Using Deep Neural Networks
    Iliev, Alexander I.
    Mote, Ameya
    DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2022, 12 : 77 - 88
  • [8] RECOGNITION OF ACOUSTIC EVENTS USING DEEP NEURAL NETWORKS
    Gencoglu, Oguzhan
    Virtanen, Tuomas
    Huttunen, Heikki
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 506 - 510
  • [9] Emotional Speech Recognition Using Deep Neural Networks
    Trinh Van, Loan
    Dao Thi Le, Thuy
    Le Xuan, Thanh
    Castelli, Eric
    SENSORS, 2022, 22 (04)
  • [10] Modulation Recognition Using Hierarchical Deep Neural Networks
    Karra, Krishna
    Kuzdeba, Scott
    Petersen, Josh
    2017 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (IEEE DYSPAN), 2017,