Assessment of Parkinson's Disease Severity From Videos Using Deep Architectures

被引：14

作者：

Yin, Zhao ^{[1
]}

Geraedts, Victor J. ^{[2
,3
]}

Wang, Ziqi ^{[1
]}

Contarino, Maria Fiorella ^{[2
,4
]}

Dibeklioglu, Hamdi ^{[5
]}

van Gemert, Jan ^{[1
]}

机构：

[1] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, NL-2600 Delft, Netherlands

[2] Leiden Univ, Med Ctr, Dept Neurol, Leiden, Netherlands

[3] Leiden Univ, Med Ctr, Dept Epidemiol, Leiden, Netherlands

[4] Haga Teaching Hosp, Dept Neurol, The Hague, Netherlands

[5] Bilkent Univ, Dept Comp Engn, Ankara, Turkey

来源：

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS | 2022年 / 26卷 / 03期

关键词：

Task analysis; Videos; Feature extraction; Three-dimensional displays; Transfer learning; Diseases; Training; Parkinson's disease (PD); severity classification; deep learning; transfer learning; self-attention; multi-domain learning; BRAIN-STIMULATION; MDS-UPDRS;

D O I：

10.1109/JBHI.2021.3099816

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Parkinson's disease (PD) diagnosis is based on clinical criteria, i.e., bradykinesia, rest tremor, rigidity, etc. Assessment of the severity of PD symptoms with clinical rating scales, however, is subject to inter-rater variability. In this paper, we propose a deep learning based automatic PD diagnosis method using videos to assist the diagnosis in clinical practices. We deploy a 3D Convolutional Neural Network (CNN) as the baseline approach for the PD severity classification and show the effectiveness. Due to the lack of data in clinical field, we explore the possibility of transfer learning from non-medical dataset and show that PD severity classification can benefit from it. To bridge the domain discrepancy between medical and non-medical datasets, we let the network focus more on the subtle temporal visual cues, i.e., the frequency of tremors, by designing a Temporal Self-Attention (TSA) mechanism. Seven tasks from the Movement Disorders Society - Unified PD rating scale (MDS-UPDRS) part III are investigated, which reveal the symptoms of bradykinesia and postural tremors. Furthermore, we propose a multi-domain learning method to predict the patient-level PD severity through task-assembling. We show the effectiveness of TSA and task-assembling method on our PD video dataset empirically. We achieve the best MCC of 0.55 on binary task-level and 0.39 on three-class patient-level classification.

引用

页码：1164 / 1176

页数：13

共 60 条

[1] Baccouche Moez, 2011, Human Behavior Unterstanding. Proceedings Second International Workshop, HBU 2011, P29, DOI 10.1007/978-3-642-25446-8_4
[2] Belalcazar-Bolanos E.A., 2015, Signal Processing, Images and Computer Vision (STSIVA), 2015 20th Symposium on, P1
[3] Attention Augmented Convolutional Networks
Bello, Irwan
Zoph, Barret
Vaswani, Ashish
Shlens, Jonathon
Le, Quoc V.
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3285 - 3294
[4] Butt AH, 2017, INT C REHAB ROBOT, P116, DOI 10.1109/ICORR.2017.8009232
[5] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Carreira, Joao
Zisserman, Andrew
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
[6] Cavallanti G, 2010, J MACH LEARN RES, V11, P2901
[7] Chorowski J, 2015, ADV NEUR IN, V28
[8] The MDS-UPDRS tracks motor and non-motor improvement due to subthalamic nucleus deep brain stimulation in Parkinson disease
Chou, Kelvin L.
Taylor, Jennifer L.
Patil, Parag G.
[J]. PARKINSONISM & RELATED DISORDERS, 2013, 19 (11) : 966 - 969
[9] Class-Balanced Loss Based on Effective Number of Samples
Cui, Yin
Jia, Menglin
Lin, Tsung-Yi
Song, Yang
Belongie, Serge
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9260 - 9269
[10] Robust real-time periodic motion detection, analysis, and applications
Cutler, R
Davis, LS
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) : 781 - 796

← 1 2 3 4 5 6 →