Automated sleep stage scoring of the Sleep Heart Health Study using deep neural networks

被引:72
作者
Zhang, Linda [1 ]
Fabbri, Daniel [1 ]
Upender, Raghu [2 ]
Kent, David [3 ]
机构
[1] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Nashville, TN USA
[2] Vanderbilt Univ, Sch Med, Dept Neurol, Sleep Disorders Div, Nashville, TN 37212 USA
[3] Vanderbilt Univ, Med Ctr, Dept Otolaryngol, 1215 21st Ave South,Suite 7209, Nashville, TN 37232 USA
基金
美国国家卫生研究院;
关键词
polysomnography; sleep staging; deep learning; machine learning; INTERRATER RELIABILITY; AMERICAN ACADEMY; CLASSIFICATION; EEG; SYSTEM; VARIABILITY; VALIDATION; MEDICINE; RECHTSCHAFFEN; PERFORMANCE;
D O I
10.1093/sleep/zsz159
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Study Objectives: Polysomnography (PSG) scoring is labor intensive and suffers from variability in inter- and intra-rater reliability. Automated PSG scoring has the potential to reduce the human labor costs and the variability inherent to this task. Deep learning is a form of machine learning that uses neural networks to recognize data patterns by inspecting many examples rather than by following explicit programming. Methods: A sleep staging classifier trained using deep learning methods scored PSG data from the Sleep Heart Health Study (SHHS). The training set was composed of 42 560 hours of PSG data from 5213 patients. To capture higher-order data, spectrograms were generated from electroencephalography, electrooculography, and electromyography data and then passed to the neural network. A holdout set of 580 PSGs not included in the training set was used to assess model accuracy and discrimination via weighted F1-score, per-stage accuracy, and Cohen's kappa (K). Results: The optimal neural network model was composed of spectrograms in the input layer feeding into convolutional neural network layers and a long short-term memory layer to achieve a weighted F1-score of 0.87 and K = 0.82. Conclusions: The deep learning sleep stage classifier demonstrates excellent accuracy and agreement with expert sleep stage scoring, outperforming human agreement on sleep staging. It achieves comparable or better F1-scores, accuracy, and Cohen's kappa compared to literature for automated sleep stage scoring of PSG epochs. Accurate automated scoring of other PSG events may eventually allow for fully automated PSG scoring.
引用
收藏
页数:10
相关论文
共 56 条
[1]   Computer-assisted sleep staging [J].
Agarwal, R ;
Gotman, J .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2001, 48 (12) :1412-1423
[2]   An E-health solution for automatic sleep classification according to Rechtschaffen and Kales:: Validation study of the Somnolyzer 24 x 7 utilizing the Siesta database [J].
Anderer, P ;
Gruber, G ;
Parapatics, S ;
Woertz, M ;
Miazhynskaia, T ;
Klösch, G ;
Saletu, B ;
Zeitlhofer, J ;
Barbanoj, MJ ;
Danker-Hopfe, H ;
Himanen, SL ;
Kemp, B ;
Penzel, T ;
Grözinger, M ;
Kunz, D ;
Rappelsberger, P ;
Schlögl, A ;
Dorffner, G .
NEUROPSYCHOBIOLOGY, 2005, 51 (03) :115-133
[3]   Computer-Assisted Sleep Classification according to the Standard of the American Academy of Sleep Medicine: Validation Study of the AASM Version of the Somnolyzer 24 x 7 [J].
Anderer, Peter ;
Moreau, Arnaud ;
Woertz, Michael ;
Ross, Marco ;
Gruber, Georg ;
Parapatics, Silvia ;
Loretz, Erna ;
Heller, Esther ;
Schmidt, Andrea ;
Boeck, Marion ;
Moser, Doris ;
Kloesch, Gerhard ;
Saletu, Bernd ;
Saletu-Zyhlarz, Gerda M. ;
Danker-Hopfe, Heidi ;
Zeitlhofer, Josef ;
Dorffner, Georg .
NEUROPSYCHOBIOLOGY, 2010, 62 (04) :250-264
[4]   Automatic classification of sleep stages based on the time-frequency image of EEG signals [J].
Bajaj, Varun ;
Pachori, Ram Bilas .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2013, 112 (03) :320-328
[5]   Automatic analysis of single-channel sleep EEG:: Validation in healthy individuals [J].
Berthomier, Christian ;
Drouot, Xavier ;
Herman-Stoieca, Maria ;
Berthomier, Pierre ;
Prado, Jacques ;
Bokar-Thire, Djibril ;
Benoit, Odile ;
Mattout, Jeremie ;
d'Ortho, Marie-Pia .
SLEEP, 2007, 30 (11) :1587-1595
[6]  
Biswal S., 2017, SLEEPNET: Automated Sleep Staging System via Deep Learning. arXiv, P1
[7]   MEASUREMENT ERROR IN VISUALLY SCORED ELECTROPHYSIOLOGICAL DATA - RESPIRATION DURING SLEEP [J].
BLIWISE, D ;
BLIWISE, NG ;
KRAEMER, HC ;
DEMENT, W .
JOURNAL OF NEUROSCIENCE METHODS, 1984, 12 (01) :49-56
[8]   The Role of Big Data in the Management of Sleep-Disordered Breathing [J].
Budhiraja, Rohit ;
Thomas, Robert ;
Kim, Matthew ;
Redline, Susan .
SLEEP MEDICINE CLINICS, 2016, 11 (02) :241-+
[9]   Convolutional Neural Networks for P300 Detection with Application to Brain-Computer Interfaces [J].
Cecotti, Hubert ;
Graeser, Axel .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :433-445
[10]   A Deep Learning Architecture for Temporal Sleep Stage Classification Using Multivariate and Multimodal Time Series [J].
Chambon, Stanislas ;
Galtier, Mathieu N. ;
Arnal, Pierrick J. ;
Wainrib, Gilles ;
Gramfort, Alexandre .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2018, 26 (04) :758-769