共 50 条
SViT: A Spectral Vision Transformer for the Detection of REM Sleep Behavior Disorder
被引:7
|作者:
Gunter, Katarina Mary
[1
,2
]
Brink-Kjaer, Andreas
[3
]
Mignot, Emmanuel
[4
]
Sorensen, Helge B. D.
[3
]
During, Emmanuel
[5
]
Jennum, Poul
[6
]
机构:
[1] Tech Univ Denmark, DK-2800 Kongens Lyngby, Denmark
[2] John Radcliffe Hosp, Nuffield Dept Clin Neurosci, Oxford OX3 9DU, England
[3] Tech Univ Denmark, Dept Hlth Technol, DK-2800 Kongens Lyngby, Denmark
[4] Stanford Univ, Ctr Sleep Sci & Med, Palo Alto, CA 94304 USA
[5] Stanford Univ, Dept Psychiat & Behav Sci, Palo Alto, CA 94304 USA
[6] Rigshospitalet, Danish Ctr Sleep Med, Dept Clin Neurophysiol, DK-2100 Copenhagen, Denmark
关键词:
Computer vision;
deep learning;
Parkinson's disease;
polysomnography;
RBD;
vision transformer;
PARKINSONIAN DISORDER;
DELAYED EMERGENCE;
DEMENTIA;
EEG;
D O I:
10.1109/JBHI.2023.3292231
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
REM sleep behavior disorder (RBD) is a parasomnia with dream enactment and presence of REM sleep without atonia (RSWA). RBD diagnosed manually via polysomnography (PSG) scoring, which is time intensive. Isolated RBD (iRBD) is also associated with a high probability of conversion to Parkinson's disease. Diagnosis of iRBD is largely based on clinical evaluation and subjective PSG ratings of REM sleep without atonia. Here we show the first application of a novel spectral vision transformer (SViT) to PSG signals for detection of RBD and compare the results to the more conventional convolutional neural network architecture. The vision-based deep learning models were applied to scalograms (30 or 300 s windows) of the PSG data (EEG, EMG and EOG) and the predictions interpreted. A total of 153 RBD (96 iRBD and 57 RBD with PD) and 190 controls were included in the study and 5-fold bagged ensemble was used. Model outputs were analyzed per-patient (averaged), with regards to sleep stage, and the SViT was interpreted using integrated gradients. Models had a similar per-epoch test F1 score. However, the vision transformer had the best per-patient performance, with an F1 score 0.87. Training the SViT on channel subsets, it achieved an F1 score of 0.93 on a combination of EEG and EOG. EMG is thought to have the highest diagnostic yield, but interpretation of our model showed that high relevance was placed on EEG and EOG, indicating these channels could be included for diagnosing RBD.
引用
收藏
页码:4285 / 4292
页数:8
相关论文