A Deep Learning Architecture for Temporal Sleep Stage Classification Using Multivariate and Multimodal Time Series

被引:351
作者
Chambon, Stanislas [1 ,2 ]
Galtier, Mathieu N. [1 ]
Arnal, Pierrick J. [1 ]
Wainrib, Gilles [3 ]
Gramfort, Alexandre [4 ,5 ,6 ]
机构
[1] Rythm Inc, Res & Algorithms Team, Paris, France
[2] Univ Paris Saclay, Telecom ParisTech, Lab Traitement & Commun Informat, Paris, France
[3] Ecole Normale Super, Dept Informat, DATA Team, F-75005 Paris, France
[4] Univ Paris Saclay, Telecom ParisTech, LTCI, Paris, France
[5] Univ Paris Saclay, INRIA, Paris, France
[6] Univ Paris Saclay, CEA, Paris, France
关键词
Sleep stage classification; multivariate time series; deep learning; spatio-temporal data; transfer learning; EEG; EOG; EMG; EEG;
D O I
10.1109/TNSRE.2018.2813138
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Sleep stage classification constitutes an important preliminary exam in the diagnosis of sleep disorders. It is traditionally performed by a sleep expert who assigns to each 30 s of the signal of a sleep stage, based on the visual inspection of signals such as electroencephalograms (EEGs), electrooculograms (EOGs), electrocardiograms, and electromyograms (EMGs). We introduce here the first deep learning approach for sleep stage classification that learns end-to-end without computing spectrograms or extracting handcrafted features, that exploits all multivariate and multimodal polysomnography (PSG) signals (EEG, EMG, and EOG), and that can exploit the temporal context of each 30-s window of data. For each modality, the first layer learns linear spatial filters that exploit the array of sensors to increase the signal-to-noise ratio, and the last layer feeds the learnt representation to a softmax classifier. Our model is compared to alternative automatic approaches based on convolutional networks or decisions trees. Results obtained on 61 publicly available PSG records with up to 20 EEG channels demonstrate that our network architecture yields the state-of-the-art performance. Our study reveals a number of insights on the spatiotemporal distribution of the signal of interest: a good tradeoff for optimal classification performance measured with balanced accuracy is to use 6 EEG with 2 EOG (left and right) and 3 EMG chin channels. Also exploiting 1 min of data before and after each data segment offers the strongest improvement when a limited number of channels are available. As sleep experts, our system exploits the multivariate and multimodal nature of PSG signals in order to deliver the state-of-the-art classification performance with a small computational cost.
引用
收藏
页码:758 / 769
页数:12
相关论文
共 46 条
[1]  
Abadi M., 2015, TensorFlow: Large-scale machine learning on heterogeneous systems.
[2]   Sleep Stage Classification Using EEG Signal Analysis: A Comprehensive Survey and New Investigation [J].
Aboalayon, Khald Ali I. ;
Faezipour, Miad ;
Almuhammadi, Wafaa S. ;
Moslehpour, Saeid .
ENTROPY, 2016, 18 (09)
[3]  
[Anonymous], 1969, ELECTROEN CLIN NEURO, DOI [10.1016/0013-4694(69)90021-2, DOI 10.1016/0013-4694(69)90021-2]
[4]  
[Anonymous], CORR
[5]  
[Anonymous], 2016, ICLR
[6]  
[Anonymous], TECH REP
[7]  
[Anonymous], 2016, EEGNET COMPACT CONVO
[8]  
[Anonymous], DEEP FEATURE LEARNIN
[9]  
[Anonymous], 2008, 2008 19 INT C PATT R
[10]  
[Anonymous], MIXED NEURAL NETWORK