Voice based classification of patients with Amyotrophic Lateral Sclerosis, Parkinson's Disease and Healthy Controls with CNN-LSTM using transfer learning

被引:0
作者
Mallela, Jhansi [1 ]
Illa, Aravind [1 ]
Suhas, B. N. [1 ]
Udupa, Sathvik [1 ]
Belur, Yamini [3 ]
Atchayaram, Nalini [4 ]
Yadav, Ravi [4 ]
Reddy, Pradeep [4 ]
Gope, Dipanjan [2 ]
Ghosh, Prasanta Kumar [1 ]
机构
[1] Indian Inst Sci, EE Dept, Bengaluru 560012, India
[2] Indian Inst Sci, ECE Dept, Bengaluru 560012, India
[3] Natl Inst Mental Hlth & Neurosci, Dept Speech Pathol & Audiol, Bengaluru 560029, India
[4] Natl Inst Mental Hlth & Neurosci, Dept Neurol, Bengaluru 560029, India
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
Amyotrophic Lateral Sclerosis; Parkinson's Disease; CNN-LSTM; DIAGNOSIS; PATTERN; SPEECH; ALS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we consider 2-class and 3-class classification problems for classifying patients with Amyotrophic Lateral Sclerosis (ALS), Parkinson's Disease (PD), and Healthy Controls (HC) using a CNN-LSTM network. Classification performance is examined for three different tasks, namely, Spontaneous speech (SPON), Diadochokinetic rate (DIDK) and Sustained phoneme production (PHON). Experiments are conducted using speech data recorded from 60 ALS, 60 PD, and 60 HC subjects. Classifications using SVM and DNN are considered as baseline schemes. Classification accuracy of ALS and HC (indicated by ALS/HC) using CNN-LSTM has shown an improvement of 10.40%, 4.22% and 0.08% for PHON, SPON and DIDK tasks, respectively over the best of the baseline schemes. Furthermore, the CNN-LSTM network achieves the highest PD/HC classification accuracy of 88.5% for the SPON task and the highest 3-class (ALS/PD/HC) classification accuracy of 85.24% for the DIDK task. Experiments using transfer learning at low resource training data show that data from ALS benefits PD/HC classification and vice-versa. Experiments with fine-tuning weights of 3-class (ALS/PD/HC) classifier for 2-class classification (PD/HC or ALS/HC) gives an absolute improvement of 2% classification accuracy in SPON task when compared with randomly initialized 2-class classifier.
引用
收藏
页码:6784 / 6788
页数:5
相关论文
共 24 条
[11]   PATTERNS OF DYSARTHRIC MOVEMENT IN PATIENTS WITH AMYOTROPHIC LATERAL SCLEROSIS AND PSEUDO-BULBAR PALSY [J].
HIROSE, H ;
KIRITANI, S ;
SAWASHIMA, M .
FOLIA PHONIATRICA, 1982, 34 (02) :106-112
[12]  
Hirose H, 1982, ANN B RES I LOGOPEDI, V16, P229
[13]  
Hochreiter S., 1997, Neural Computation, V9, P1735
[14]  
Illa A, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P6014, DOI 10.1109/ICASSP.2018.8461836
[15]  
Kent R. D., 1996, American Journal of Speech-Language Pathology, V5, P7, DOI DOI 10.1044/1058-0360.0503.07
[16]   SPEECH DETERIORATION IN AMYOTROPHIC-LATERAL-SCLEROSIS - A CASE-STUDY [J].
KENT, RD ;
SUFIT, RL ;
ROSENBEK, JC ;
KENT, JF ;
WEISMER, G ;
MARTIN, RE ;
BROOKS, BR .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1991, 34 (06) :1269-1275
[17]   Amyotrophic lateral sclerosis [J].
Kiernan, Matthew C. ;
Vucic, Steve ;
Cheah, Benjamin C. ;
Turner, Martin R. ;
Eisen, Andrew ;
Hardiman, Orla ;
Burrell, James R. ;
Zoing, Margaret C. .
LANCET, 2011, 377 (9769) :942-955
[18]   Understanding deep convolutional networks [J].
Mallat, Stephane .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2016, 374 (2065)
[19]   Speaking rate effects on articulatory pattern consistency in talkers with mild ALS [J].
Mefferd, Antje S. ;
Pattee, Gary L. ;
Green, Jordan R. .
CLINICAL LINGUISTICS & PHONETICS, 2014, 28 (11) :799-811
[20]  
Mishra VK, 2016, PROCEEDINGS OF THE 2016 IEEE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL & ELECTRONICS, INFORMATION, COMMUNICATION & BIO INFORMATICS (IEEE AEEICB-2016), P455, DOI 10.1109/AEEICB.2016.7538330