A multi-branch convolutional neural network for snoring detection based on audio

被引:4
作者
Dong, Hao [1 ,2 ]
Wu, Haitao [2 ,3 ]
Yang, Guan [1 ]
Zhang, Junming [2 ,3 ,4 ,5 ]
Wan, Keqin [2 ]
机构
[1] Zhongyuan Univ Technol, Sch Comp Sci, Zhengzhou, Henan, Peoples R China
[2] Huanghuai Univ, Sch Comp & Artificial Intelligence, Zhumadian, Henan, Peoples R China
[3] Henan Key Lab Smart Lighting, Zhumadian, Henan, Peoples R China
[4] Henan Joint Int Res Lab Behav Optimizat Control Sm, Zhumadian, Henan, Peoples R China
[5] Zhumadian Artificial Intelligence & Med Engn Tech, Zhumadian, Henan, Peoples R China
基金
美国国家科学基金会;
关键词
Obstructive sleep apnea; snore detection; convolutional neural network; multi-scale features; deep learning; OBSTRUCTIVE SLEEP-APNEA;
D O I
10.1080/10255842.2024.2317438
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Obstructive sleep apnea (OSA) is associated with various health complications, and snoring is a prominent characteristic of this disorder. Therefore, the exploration of a concise and effective method for detecting snoring has consistently been a crucial aspect of sleep medicine. As the easily accessible data, the identification of snoring through sound analysis offers a more convenient and straightforward method. The objective of this study was to develop a convolutional neural network (CNN) for classifying snoring and non-snoring events based on audio. This study utilized Mel-frequency cepstral coefficients (MFCCs) as a method for extracting features during the preprocessing of raw data. In order to extract multi-scale features from the frequency domain of sound sources, this study proposes the utilization of a multi-branch convolutional neural network (MBCNN) for the purpose of classification. The network utilized asymmetric convolutional kernels to acquire additional information, while the adoption of one-hot encoding labels aimed to mitigate the impact of labels. The experiment tested the network's performance by utilizing a publicly available dataset consisting of 1,000 sound samples. The test results indicate that the MBCNN achieved a snoring detection accuracy of 99.5%. The integration of multi-scale features and the implementation of MBCNN, based on audio data, have demonstrated a substantial improvement in the performance of snoring classification.
引用
收藏
页码:1243 / 1254
页数:12
相关论文
共 28 条
[11]   A Novel Snore Detection and Suppression Method for a Flexible Patch With MEMS Microphone and Accelerometer [J].
He, Chunhua ;
Tan, Jiewen ;
Jian, Xuelei ;
Zhong, Guangxiong ;
Wu, Heng ;
Cheng, Lianglun ;
Lin, Juze .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) :25791-25804
[12]   Speaker state classification based on fusion of asymmetric simple partial least squares (SIMPLS) and support vector machines [J].
Huang, Dong-Yan ;
Zhang, Zhengchen ;
Ge, Shuzhi Sam .
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (02) :392-419
[13]   Automatic snoring sounds detection from sleep sounds based on deep learning [J].
Jiang, Yanmei ;
Peng, Jianxin ;
Zhang, Xiaowen .
PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2020, 43 (02) :679-689
[15]   Obstructive sleep apnea and risk of stroke: A meta-analysis of prospective studies [J].
Li, Min ;
Hou, Wen-Shang ;
Zhang, Xiao-Wei ;
Tang, Zhen-Yu .
INTERNATIONAL JOURNAL OF CARDIOLOGY, 2014, 172 (02) :466-469
[16]   Automatic snoring detection using a hybrid 1D-2D convolutional neural network [J].
Li, Ruixue ;
Li, Wenjun ;
Yue, Keqiang ;
Zhang, Rulin ;
Li, Yilin .
SCIENTIFIC REPORTS, 2023, 13 (01)
[17]   SOME EPIDEMIOLOGICAL DATA ON SNORING AND CARDIOCIRCULATORY DISTURBANCES [J].
LUGARESI, E ;
CIRIGNOTTA, F ;
COCCAGNA, G ;
PIANA, C .
SLEEP, 1980, 3 (3-4) :221-224
[18]  
Maimon N, 2010, J CLIN SLEEP MED, V6, P475
[19]  
Pons Jordi., 2016, Content-BasedMultimedia Indexing (CBMI), 201614th International Workshopon, P1, DOI 10.1109/CBMI.2016.7500246
[20]  
Rabiner L.R., 1978, IEEE Transactions on Systems, Man, and Cybernetics, V8, P146, DOI [DOI 10.1109/TSMC.1978.4309918, 10.1109/TSMC.1978.4309918]