Large-Scale Automated Sleep Staging

被引:79
作者
Sun, Haoqi [1 ,2 ]
Jia, Jian [3 ]
Goparaju, Balaji [4 ]
Huang, Guang-Bin [5 ]
Sourina, Olga [2 ]
Bianchi, Matt Travis [4 ]
Westover, M. Brandon [4 ]
机构
[1] Nanyang Technol Univ, Energy Res Inst NTU, Interdisciplinary Grad Sch, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Fraunhofer IDM NTU, Xian 710127, Shaanxi, Peoples R China
[3] Northwest Univ, Sch Math, Xian 710127, Shaanxi, Peoples R China
[4] Massachusetts Gen Hosp, Dept Neurol, Boston, MA 02114 USA
[5] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
sleep stages; EEG; machine learning; big data; EXTREME LEARNING-MACHINE; CLASSIFICATION; VALIDATION; EEG; RECHTSCHAFFEN; PERFORMANCE; MEDICINE; SYSTEM; KALES;
D O I
10.1093/sleep/zsx139
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Study Objectives: Automated sleep staging has been previously limited by a combination of clinical and physiological heterogeneity. Both factors are in principle addressable with large data sets that enable robust calibration. However, the impact of sample size remains uncertain. The objectives are to investigate the extent to which machine learning methods can approximate the performance of human scorers when supplied with sufficient training cases and to investigate how staging performance depends on the number of training patients, contextual information, model complexity, and imbalance between sleep stage proportions. Methods: A total of 102 features were extracted from six electroencephalography (EEG) channels in routine polysomnography. Two thousand nights were partitioned into equal (n = 1000) training and testing sets for validation. We used epoch-by-epoch Cohen's kappa statistics to measure the agreement between classifier output and human scorer according to American Academy of Sleep Medicine scoring criteria. Results: Epoch-by-epoch Cohen's kappa improved with increasing training EEG recordings until saturation occurred (n = similar to 300). The kappa value was further improved by accounting for contextual (temporal) information, increasing model complexity, and adjusting the model training procedure to account for the imbalance of stage proportions. The final kappa on the testing set was 0.68. Testing on more EEG recordings leads to kappa estimates with lower variance. Conclusion: Training with a large data set enables automated sleep staging that compares favorably with human scorers. Because testing was performed on a large and heterogeneous data set, the performance estimate has low variance and is likely to generalize broadly.
引用
收藏
页数:12
相关论文
共 33 条
[1]   An E-health solution for automatic sleep classification according to Rechtschaffen and Kales:: Validation study of the Somnolyzer 24 x 7 utilizing the Siesta database [J].
Anderer, P ;
Gruber, G ;
Parapatics, S ;
Woertz, M ;
Miazhynskaia, T ;
Klösch, G ;
Saletu, B ;
Zeitlhofer, J ;
Barbanoj, MJ ;
Danker-Hopfe, H ;
Himanen, SL ;
Kemp, B ;
Penzel, T ;
Grözinger, M ;
Kunz, D ;
Rappelsberger, P ;
Schlögl, A ;
Dorffner, G .
NEUROPSYCHOBIOLOGY, 2005, 51 (03) :115-133
[2]   Computer-Assisted Sleep Classification according to the Standard of the American Academy of Sleep Medicine: Validation Study of the AASM Version of the Somnolyzer 24 x 7 [J].
Anderer, Peter ;
Moreau, Arnaud ;
Woertz, Michael ;
Ross, Marco ;
Gruber, Georg ;
Parapatics, Silvia ;
Loretz, Erna ;
Heller, Esther ;
Schmidt, Andrea ;
Boeck, Marion ;
Moser, Doris ;
Kloesch, Gerhard ;
Saletu, Bernd ;
Saletu-Zyhlarz, Gerda M. ;
Danker-Hopfe, Heidi ;
Zeitlhofer, Josef ;
Dorffner, Georg .
NEUROPSYCHOBIOLOGY, 2010, 62 (04) :250-264
[3]   A Review of Multitaper Spectral Analysis [J].
Babadi, Behtash ;
Brown, Emery N. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2014, 61 (05) :1555-1564
[4]   Automatic analysis of single-channel sleep EEG:: Validation in healthy individuals [J].
Berthomier, Christian ;
Drouot, Xavier ;
Herman-Stoieca, Maria ;
Berthomier, Pierre ;
Prado, Jacques ;
Bokar-Thire, Djibril ;
Benoit, Odile ;
Mattout, Jeremie ;
d'Ortho, Marie-Pia .
SLEEP, 2007, 30 (11) :1587-1595
[5]  
Bishop C., 2006, Pattern recognition and machine learning, P423
[6]   Interrater reliability for sleep scoring according to the Rechtschaffen & Kales and the new AASM standard [J].
Danker-Hopfe, Heidi ;
Anderer, Peter ;
Zeitlhofer, Josef ;
Boeck, Marion ;
Dorn, Hans ;
Gruber, Georg ;
Heller, Esther ;
Loretz, Erna ;
Moser, Doris ;
Parapatics, Silvia ;
Saletu, Bernd ;
Schmidt, Andrea ;
Dorffner, Georg .
JOURNAL OF SLEEP RESEARCH, 2009, 18 (01) :74-84
[7]   Scaling Up Scientific Discovery in Sleep Medicine: The National Sleep Research Resource [J].
Dean, Dennis A., II ;
Goldberger, Ary L. ;
Mueller, Remo ;
Kim, Matthew ;
Rueschman, Michael ;
Mobley, Daniel ;
Sahoo, Satya S. ;
Jayapandian, Catherine P. ;
Cui, Licong ;
Morrical, Michael G. ;
Surovec, Susan ;
Zhang, Guo-Qiang ;
Redline, Susan .
SLEEP, 2016, 39 (05) :1151-1164
[8]  
Durbin R., 1998, BIOL SEQUENCE ANAL P
[9]  
Esteller R, 2001, P ENG MED BIOL SOC 2
[10]   Classification of Sleep Stages Using Multi-wavelet Time Frequency Entropy and LDA [J].
Fraiwan, L. ;
Lweesy, K. ;
Khasawneh, N. ;
Fraiwan, M. ;
Wenz, H. ;
Dickhaus, H. .
METHODS OF INFORMATION IN MEDICINE, 2010, 49 (03) :230-237