A multi-branch convolutional neural network for snoring detection based on audio

被引：4

作者：

Dong, Hao ^{[1
,2
]}

Wu, Haitao ^{[2
,3
]}

Yang, Guan ^{[1
]}

Zhang, Junming ^{[2
,3
,4
,5
]}

Wan, Keqin ^{[2
]}

机构：

[1] Zhongyuan Univ Technol, Sch Comp Sci, Zhengzhou, Henan, Peoples R China

[2] Huanghuai Univ, Sch Comp & Artificial Intelligence, Zhumadian, Henan, Peoples R China

[3] Henan Key Lab Smart Lighting, Zhumadian, Henan, Peoples R China

[4] Henan Joint Int Res Lab Behav Optimizat Control Sm, Zhumadian, Henan, Peoples R China

[5] Zhumadian Artificial Intelligence & Med Engn Tech, Zhumadian, Henan, Peoples R China

来源：

COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING | 2025年 / 28卷 / 08期

基金：

美国国家科学基金会;

关键词：

Obstructive sleep apnea; snore detection; convolutional neural network; multi-scale features; deep learning; OBSTRUCTIVE SLEEP-APNEA;

D O I：

10.1080/10255842.2024.2317438

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Obstructive sleep apnea (OSA) is associated with various health complications, and snoring is a prominent characteristic of this disorder. Therefore, the exploration of a concise and effective method for detecting snoring has consistently been a crucial aspect of sleep medicine. As the easily accessible data, the identification of snoring through sound analysis offers a more convenient and straightforward method. The objective of this study was to develop a convolutional neural network (CNN) for classifying snoring and non-snoring events based on audio. This study utilized Mel-frequency cepstral coefficients (MFCCs) as a method for extracting features during the preprocessing of raw data. In order to extract multi-scale features from the frequency domain of sound sources, this study proposes the utilization of a multi-branch convolutional neural network (MBCNN) for the purpose of classification. The network utilized asymmetric convolutional kernels to acquire additional information, while the adoption of one-hot encoding labels aimed to mitigate the impact of labels. The experiment tested the network's performance by utilizing a publicly available dataset consisting of 1,000 sound samples. The test results indicate that the MBCNN achieved a snoring detection accuracy of 99.5%. The integration of multi-scale features and the implementation of MBCNN, based on audio data, have demonstrated a substantial improvement in the performance of snoring classification.

引用

页码：1243 / 1254

页数：12

共 28 条

[1]

Arsenali B, 2018, IEEE ENG MED BIO, P328, DOI 10.1109/EMBC.2018.8512251

[2]

Banluesombatkul N, 2018, TENCON IEEE REGION, P2011, DOI 10.1109/TENCON.2018.8650429

[3] Excessive daytime sleepiness in young and middle-aged Chinese adults with obstructive sleep apnea: implications for cognitive dysfunction [J].

Cai, Sijie ;

Li, Zhiqiang ;

Wang, Jing ;

Wang, Qiaojun ;

Chen, Rui .

SLEEP AND BREATHING, 2024, 28 (01) :113-121

[4] Validation of snoring detection using a smartphone app [J].

Chiang, Jui-Kun ;

Lin, Yen-Chang ;

Lin, Chih-Wen ;

Ting, Ching-Shiung ;

Chiang, Yi-Ying ;

Kao, Yee-Hsin .

SLEEP AND BREATHING, 2022, 26 (01) :81-87

[5] Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning [J].

Cummins, Nicholas ;

Baird, Alice ;

Schuller, Bjoern W. .

METHODS, 2018, 151 :41-54

[6] Automatic Detection of Whole Night Snoring Events Using Non-Contact Microphone [J].

Dafna, Eliran ;

Tarasiuk, Ariel ;

Zigel, Yaniv .

PLOS ONE, 2013, 8 (12)

[7] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].

DAVIS, SB ;

MERMELSTEIN, P .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366

[8] Application of substitution box of present cipher for automated detection of snoring sounds [J].

Dogan, Sengul ;

Akbal, Erhan ;

Tuncer, Turker ;

Acharya, U. Rajendra .

ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 117

[9]

Epstein LJ, 2009, J CLIN SLEEP MED, V5, P263

[10]

Fayek H. M., 2016, Speech Processing for Machine Learning: Filter banks, Mel-Frequency Cepstral Coefficients (MFCCs) and What's In-Between

← 1 2 3 →