Multi-Modal Sleep Stage Classification With Two-Stream Encoder-Decoder

被引:4
作者
Zhang, Zhao [1 ,2 ,3 ]
Lin, Bor-Shyh [4 ]
Peng, Chih-Wei [5 ,6 ]
Lin, Bor-Shing [2 ]
机构
[1] Wuyi Univ, Coll Mech & Elect Engn, Wuyishan 354300, Fujian, Peoples R China
[2] Natl Taipei Univ, Dept Comp Sci & Informat Engn, New Taipei 237303, Taiwan
[3] Natl Taipei Univ, Coll Elect Engn & Comp Sci, New Taipei 237303, Taiwan
[4] Natl Yang Ming Chiao Tung Univ, Inst Imaging & Biomed Photon, Tainan 71150, Taiwan
[5] Taipei Med Univ, Coll Biomed Engn, Sch Biomed Engn, Taipei 11031, Taiwan
[6] Taipei Med Univ, Coll Nursing, Sch Gerontol & LongTerm Care, Taipei 11031, Taiwan
关键词
Deep learning; Electrooculography; Sleep; Fuses; Convolution; Current measurement; Feature extraction; Convolutional block attention module; deep learning; multimodal; multiscale extraction; sleep-stage classification; NEURAL-NETWORK; SYSTEM;
D O I
10.1109/TNSRE.2024.3394738
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Sleep staging serves as a fundamental assessment for sleep quality measurement and sleep disorder diagnosis. Although current deep learning approaches have successfully integrated multimodal sleep signals, enhancing the accuracy of automatic sleep staging, certain challenges remain, as follows: 1) optimizing the utilization of multi-modal information complementarity, 2) effectively extracting both long- and short-range temporal features of sleep information, and 3) addressing the class imbalance problem in sleep data. To address these challenges, this paper proposes a two-stream encode-decoder network, named TSEDSleepNet, which is inspired by the depth sensitive attention and automatic multi-modal fusion (DSA2F) framework. In TSEDSleepNet, a two-stream encoder is used to extract the multiscale features of electrooculogram (EOG) and electroencephalogram (EEG) signals. And a self-attention mechanism is utilized to fuse the multiscale features, generating multi-modal saliency features. Subsequently, the coarser-scale construction module (CSCM) is adopted to extract and construct multi-resolution features from the multiscale features and the salient features. Thereafter, a Transformer module is applied to capture both long- and short-range temporal features from the multi-resolution features. Finally, the long- and short-range temporal features are restored with low-layer details and mapped to the predicted classification results. Additionally, the Lov & aacute;sz loss function is applied to alleviate the class imbalance problem in sleep datasets. Our proposed method was tested on the Sleep-EDF-39 and Sleep-EDF-153 datasets, and it achieved classification accuracies of 88.9% and 85.2% and Macro-F1 scores of 84.8% and 79.7%, respectively, thus outperforming conventional traditional baseline models. These results highlight the efficacy of the proposed method in fusing multi-modal information. This method has potential for application as an adjunct tool for diagnosing sleep disorders.
引用
收藏
页码:2096 / 2105
页数:10
相关论文
共 44 条
[1]   The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks [J].
Berman, Maxim ;
Triki, Amal Rannen ;
Blaschko, Matthew B. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4413-4421
[2]   MultiChannelSleepNet: A Transformer-Based Model for Automatic Sleep Stage Classification With PSG [J].
Dai, Yang ;
Li, Xiuli ;
Liang, Shanshan ;
Wang, Lukang ;
Duan, Qingtian ;
Yang, Hui ;
Zhang, Chunqing ;
Chen, Xiaowei ;
Li, Longhui ;
Li, Xingyi ;
Liao, Xiang .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (09) :4204-4215
[3]  
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, DOI 10.48550/ARXIV.1810.04805]
[4]   Mixed Neural Network Approach for Temporal Sleep Stage Classification [J].
Dong, Hao ;
Supratak, Akara ;
Pan, Wei ;
Wu, Chao ;
Matthews, Paul M. ;
Guo, Yike .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2018, 26 (02) :324-333
[5]   CoSleepNet: Automated sleep staging using a hybrid CNN-LSTM network on imbalanced EEG-EOG datasets [J].
Efe, Enes ;
Ozsen, Seral .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80
[6]   Self-Supervised Learning for Label- Efficient Sleep Stage Classification: A Comprehensive Evaluation [J].
Eldele, Emadeldeen ;
Ragab, Mohamed ;
Chen, Zhenghua ;
Wu, Min ;
Kwoh, Chee-Keong ;
Li, Xiaoli .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 :1333-1342
[7]   A dual-stream deep neural network integrated with adaptive boosting for sleep staging [J].
Fang, Yongkangjian ;
Xia, Yi ;
Chen, Peng ;
Zhang, Jun ;
Zhang, Yongliang .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
[8]   PhysioBank, PhysioToolkit, and PhysioNet - Components of a new research resource for complex physiologic signals [J].
Goldberger, AL ;
Amaral, LAN ;
Glass, L ;
Hausdorff, JM ;
Ivanov, PC ;
Mark, RG ;
Mietus, JE ;
Moody, GB ;
Peng, CK ;
Stanley, HE .
CIRCULATION, 2000, 101 (23) :E215-E220
[9]   Efficient sleep stage recognition system based on EEG signal using k-means clustering based feature weighting [J].
Gunes, Salih ;
Polat, Kemal ;
Yosunkaya, Sebnem .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (12) :7922-7928
[10]   A decision support system for automated identification of sleep stages from single-channel EEG signals [J].
Hassan, Ahnaf Rashik ;
Subasi, Abdulhamit .
KNOWLEDGE-BASED SYSTEMS, 2017, 128 :115-124