Assessment of Parkinson's Disease Severity From Videos Using Deep Architectures

被引:14
作者
Yin, Zhao [1 ]
Geraedts, Victor J. [2 ,3 ]
Wang, Ziqi [1 ]
Contarino, Maria Fiorella [2 ,4 ]
Dibeklioglu, Hamdi [5 ]
van Gemert, Jan [1 ]
机构
[1] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, NL-2600 Delft, Netherlands
[2] Leiden Univ, Med Ctr, Dept Neurol, Leiden, Netherlands
[3] Leiden Univ, Med Ctr, Dept Epidemiol, Leiden, Netherlands
[4] Haga Teaching Hosp, Dept Neurol, The Hague, Netherlands
[5] Bilkent Univ, Dept Comp Engn, Ankara, Turkey
关键词
Task analysis; Videos; Feature extraction; Three-dimensional displays; Transfer learning; Diseases; Training; Parkinson's disease (PD); severity classification; deep learning; transfer learning; self-attention; multi-domain learning; BRAIN-STIMULATION; MDS-UPDRS;
D O I
10.1109/JBHI.2021.3099816
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Parkinson's disease (PD) diagnosis is based on clinical criteria, i.e., bradykinesia, rest tremor, rigidity, etc. Assessment of the severity of PD symptoms with clinical rating scales, however, is subject to inter-rater variability. In this paper, we propose a deep learning based automatic PD diagnosis method using videos to assist the diagnosis in clinical practices. We deploy a 3D Convolutional Neural Network (CNN) as the baseline approach for the PD severity classification and show the effectiveness. Due to the lack of data in clinical field, we explore the possibility of transfer learning from non-medical dataset and show that PD severity classification can benefit from it. To bridge the domain discrepancy between medical and non-medical datasets, we let the network focus more on the subtle temporal visual cues, i.e., the frequency of tremors, by designing a Temporal Self-Attention (TSA) mechanism. Seven tasks from the Movement Disorders Society - Unified PD rating scale (MDS-UPDRS) part III are investigated, which reveal the symptoms of bradykinesia and postural tremors. Furthermore, we propose a multi-domain learning method to predict the patient-level PD severity through task-assembling. We show the effectiveness of TSA and task-assembling method on our PD video dataset empirically. We achieve the best MCC of 0.55 on binary task-level and 0.39 on three-class patient-level classification.
引用
收藏
页码:1164 / 1176
页数:13
相关论文
共 60 条
  • [1] Baccouche Moez, 2011, Human Behavior Unterstanding. Proceedings Second International Workshop, HBU 2011, P29, DOI 10.1007/978-3-642-25446-8_4
  • [2] Belalcazar-Bolanos E.A., 2015, Signal Processing, Images and Computer Vision (STSIVA), 2015 20th Symposium on, P1
  • [3] Attention Augmented Convolutional Networks
    Bello, Irwan
    Zoph, Barret
    Vaswani, Ashish
    Shlens, Jonathon
    Le, Quoc V.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3285 - 3294
  • [4] Butt AH, 2017, INT C REHAB ROBOT, P116, DOI 10.1109/ICORR.2017.8009232
  • [5] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [6] Cavallanti G, 2010, J MACH LEARN RES, V11, P2901
  • [7] Chorowski J, 2015, ADV NEUR IN, V28
  • [8] The MDS-UPDRS tracks motor and non-motor improvement due to subthalamic nucleus deep brain stimulation in Parkinson disease
    Chou, Kelvin L.
    Taylor, Jennifer L.
    Patil, Parag G.
    [J]. PARKINSONISM & RELATED DISORDERS, 2013, 19 (11) : 966 - 969
  • [9] Class-Balanced Loss Based on Effective Number of Samples
    Cui, Yin
    Jia, Menglin
    Lin, Tsung-Yi
    Song, Yang
    Belongie, Serge
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9260 - 9269
  • [10] Robust real-time periodic motion detection, analysis, and applications
    Cutler, R
    Davis, LS
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) : 781 - 796