Premature Infant Cry Classification via Deep Convolutional Recurrent Neural Network Based on Multi-class Features

被引:0
作者
R. Sabitha
P. Poonkodi
M. S. Kavitha
S. Karthik
机构
[1] SNS College of Technology,Department of Computer Science and Engineering
来源
Circuits, Systems, and Signal Processing | 2023年 / 42卷
关键词
Premature infant cry; Deep learning; Mel-frequency cepstral coefficient; Linear prediction cepstral coefficient; Bark-frequency cepstral coefficient;
D O I
暂无
中图分类号
学科分类号
摘要
The cry of a premature infant is an attempt to connect with its mother or others. The newborns are communicated in different ways depending on the reason for their screams. In recent days, the preprocessing, feature extraction, and classification of audio signals require expert attention and a lot of effort. In this paper, a novel deep convolutional recurrent neural network (DCR net) has been proposed to classify the premature infant cry signal into different categories. The acquisition of the cry signal generally requires a lengthy observation period and several activity processes to obtain all the signals of the premature infant. The relevant multi-class frequency features are extracted by using the MFCC (Mel-frequency cepstral coefficient), BFCC (Bark-frequency cepstral coefficient), and LPCC (linear prediction cepstral coefficient) features, which are combined to create a fused feature matrix that is helpful in the classification of pathological crying. Based on these features, the DCR net is used to classify sounds in the premature infant cry. The sound of the target cry signal is classified into five categories: “neh” means hunger, “heh” means discomfort, “eh” means burping, “eair” means cramps, and “owh” means fatigue. The efficiency of the DCR net was estimated with some metrics such as specificity, precision, accuracy, recall, and F1 score. The experimental fallouts disclose that the proposed DCR net attains a better classification accuracy of 97.27% for identifying infant cry signals. The DCR net increases the overall performance range by 8.61%, 11.58%, 0.54%, and 17.03% better than SVM-RBF, MFCC-SVM, optimized deep learning model, and hidden Markov model, respectively.
引用
收藏
页码:7529 / 7548
页数:19
相关论文
共 43 条
[1]  
Ashwini K(2022)A deep convolutional neural network-based approach for effective premature infant cry classification Recent Adv. Comput. Sci. Commun. 15 229-239
[2]  
Vincent PDR(2021)An efficient classification of neonates cry using extreme gradient boosting-assisted grouped-support-vector network J. Healthc. Eng. 2021 1-17
[3]  
Chang CY(2021)A review of infant cry analysis and classification EURASIP J Audio Speech Music Process. 50 35-44
[4]  
Bhattacharya S(2019)Identification of diseases in newborns using advanced acoustic features of cry signals Biomed. Signal Process. Control 154 111700-181
[5]  
Vincent PMR(2022)Deep learning systems for automatic diagnosis of infant cry signals Chaos Solitons Fractals. 45 174-14
[6]  
Lakshmanna K(2018)Automated detection and classification of basic shapes of newborn cry melody Biomed. Signal Process. Control 59 1-51993
[7]  
Srinivasan K(2022)Machine learning-based cry diagnostic system for identifying septic newborns J. Voices 1 51982-undefined
[8]  
Ji C(2020)On the use of long-term features in a newborn cry diagnostic system Biomed. Signal Process. Control 91 undefined-undefined
[9]  
Mudiyanselage TB(2018)Automatic segmentation of infant cry signals using Gaussian Mixture Models EURASIP J. Audio Speech Music Process. 7 undefined-undefined
[10]  
Gao Y(2021)Detection of ventricular fibrillation rhythm by using boosted support vector machine with an optimal variable combination Comput. Electr. Eng. 9 undefined-undefined