A multi-branch convolutional neural network for snoring detection based on audio

被引:4
作者
Dong, Hao [1 ,2 ]
Wu, Haitao [2 ,3 ]
Yang, Guan [1 ]
Zhang, Junming [2 ,3 ,4 ,5 ]
Wan, Keqin [2 ]
机构
[1] Zhongyuan Univ Technol, Sch Comp Sci, Zhengzhou, Henan, Peoples R China
[2] Huanghuai Univ, Sch Comp & Artificial Intelligence, Zhumadian, Henan, Peoples R China
[3] Henan Key Lab Smart Lighting, Zhumadian, Henan, Peoples R China
[4] Henan Joint Int Res Lab Behav Optimizat Control Sm, Zhumadian, Henan, Peoples R China
[5] Zhumadian Artificial Intelligence & Med Engn Tech, Zhumadian, Henan, Peoples R China
基金
美国国家科学基金会;
关键词
Obstructive sleep apnea; snore detection; convolutional neural network; multi-scale features; deep learning; OBSTRUCTIVE SLEEP-APNEA;
D O I
10.1080/10255842.2024.2317438
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Obstructive sleep apnea (OSA) is associated with various health complications, and snoring is a prominent characteristic of this disorder. Therefore, the exploration of a concise and effective method for detecting snoring has consistently been a crucial aspect of sleep medicine. As the easily accessible data, the identification of snoring through sound analysis offers a more convenient and straightforward method. The objective of this study was to develop a convolutional neural network (CNN) for classifying snoring and non-snoring events based on audio. This study utilized Mel-frequency cepstral coefficients (MFCCs) as a method for extracting features during the preprocessing of raw data. In order to extract multi-scale features from the frequency domain of sound sources, this study proposes the utilization of a multi-branch convolutional neural network (MBCNN) for the purpose of classification. The network utilized asymmetric convolutional kernels to acquire additional information, while the adoption of one-hot encoding labels aimed to mitigate the impact of labels. The experiment tested the network's performance by utilizing a publicly available dataset consisting of 1,000 sound samples. The test results indicate that the MBCNN achieved a snoring detection accuracy of 99.5%. The integration of multi-scale features and the implementation of MBCNN, based on audio data, have demonstrated a substantial improvement in the performance of snoring classification.
引用
收藏
页码:1243 / 1254
页数:12
相关论文
共 28 条
[1]  
Arsenali B, 2018, IEEE ENG MED BIO, P328, DOI 10.1109/EMBC.2018.8512251
[2]  
Banluesombatkul N, 2018, TENCON IEEE REGION, P2011, DOI 10.1109/TENCON.2018.8650429
[3]   Excessive daytime sleepiness in young and middle-aged Chinese adults with obstructive sleep apnea: implications for cognitive dysfunction [J].
Cai, Sijie ;
Li, Zhiqiang ;
Wang, Jing ;
Wang, Qiaojun ;
Chen, Rui .
SLEEP AND BREATHING, 2024, 28 (01) :113-121
[4]   Validation of snoring detection using a smartphone app [J].
Chiang, Jui-Kun ;
Lin, Yen-Chang ;
Lin, Chih-Wen ;
Ting, Ching-Shiung ;
Chiang, Yi-Ying ;
Kao, Yee-Hsin .
SLEEP AND BREATHING, 2022, 26 (01) :81-87
[5]   Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning [J].
Cummins, Nicholas ;
Baird, Alice ;
Schuller, Bjoern W. .
METHODS, 2018, 151 :41-54
[6]   Automatic Detection of Whole Night Snoring Events Using Non-Contact Microphone [J].
Dafna, Eliran ;
Tarasiuk, Ariel ;
Zigel, Yaniv .
PLOS ONE, 2013, 8 (12)
[7]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[8]   Application of substitution box of present cipher for automated detection of snoring sounds [J].
Dogan, Sengul ;
Akbal, Erhan ;
Tuncer, Turker ;
Acharya, U. Rajendra .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 117
[9]  
Epstein LJ, 2009, J CLIN SLEEP MED, V5, P263
[10]  
Fayek H. M., 2016, Speech Processing for Machine Learning: Filter banks, Mel-Frequency Cepstral Coefficients (MFCCs) and What's In-Between