CNN-Based Identification of Parkinson's Disease from Continuous Speech in Noisy Environments

被引:14
作者
Farago, Paul [1 ]
Stefaniga, Sebastian-Aurelian [2 ]
Cordos, Claudia-Georgiana [1 ]
Mihaila, Laura-Ioana [1 ]
Hintea, Sorin [1 ]
Pestean, Ana-Sorina [3 ]
Beyer, Michel [4 ,5 ]
Perju-Dumbrava, Lacramioara [3 ]
Ilesan, Robert Radu [3 ,4 ]
机构
[1] Tech Univ Cluj Napoca, Fac Elect Telecommun & Informat Technol, Bases Elect Dept, Cluj Napoca 400114, Romania
[2] West Univ Timisoara, Fac Math & Comp Sci, Dept Comp Sci, Timisoara 300223, Romania
[3] Univ Med Pharm Iuliu Hatieganu Cluj Napoca, Fac Med, Dept Neurol & Pediat Neurol, Cluj Napoca 400012, Romania
[4] Univ Hosp Basel, Clin Oral & Cranio Maxillofacial Surg, CH-4031 Basel, Switzerland
[5] Univ Basel, Dept Biomed Engn, Med Addit Mfg Res Grp Swiss MAM, CH-4123 Allschwil, Switzerland
来源
BIOENGINEERING-BASEL | 2023年 / 10卷 / 05期
关键词
speech assessment; hypokinetic dysarthria; artificial intelligence; Parkinson's disease; continuous speech; noisy speech; pre-diagnosis; convolutional neural networks; spectrograms; Wiener filter;
D O I
10.3390/bioengineering10050531
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Parkinson's disease is a progressive neurodegenerative disorder caused by dopaminergic neuron degeneration. Parkinsonian speech impairment is one of the earliest presentations of the disease and, along with tremor, is suitable for pre-diagnosis. It is defined by hypokinetic dysarthria and accounts for respiratory, phonatory, articulatory, and prosodic manifestations. The topic of this article targets artificial-intelligence-based identification of Parkinson's disease from continuous speech recorded in a noisy environment. The novelty of this work is twofold. First, the proposed assessment workflow performed speech analysis on samples of continuous speech. Second, we analyzed and quantified Wiener filter applicability for speech denoising in the context of Parkinsonian speech identification. We argue that the Parkinsonian features of loudness, intonation, phonation, prosody, and articulation are contained in the speech, speech energy, and Mel spectrograms. Thus, the proposed workflow follows a feature-based speech assessment to determine the feature variation ranges, followed by speech classification using convolutional neural networks. We report the best classification accuracies of 96% on speech energy, 93% on speech, and 92% on Mel spectrograms. We conclude that the Wiener filter improves both feature-based analysis and convolutional-neural-network-based classification performances.
引用
收藏
页数:37
相关论文
共 68 条
[1]   A new diagnostic approach for the identification of patients with neurodegenerative cognitive complaints [J].
Al-Hameed, Sabah ;
Benaissa, Mohammed ;
Christensen, Heidi ;
Mirheidari, Bahman ;
Blackburn, Daniel ;
Reuber, Markus .
PLOS ONE, 2019, 14 (05)
[2]   A Review of Physical and Perceptual Feature Extraction Techniques for Speech, Music and Environmental Sounds [J].
Alias, Francesc ;
Socoro, Joan Claudi ;
Sevillano, Xavier .
APPLIED SCIENCES-BASEL, 2016, 6 (05)
[3]   An algorithm for Parkinson's disease speech classification based on isolated words analysis [J].
Amato, Federica ;
Borzi, Luigi ;
Olmo, Gabriella ;
Orozco-Arroyave, Juan Rafael .
HEALTH INFORMATION SCIENCE AND SYSTEMS, 2021, 9 (01)
[4]  
Appakaya SB, 2020, IEEE ENG MED BIO, P236, DOI 10.1109/EMBC44109.2020.9176193
[5]   Connected Speech in Neurodegenerative Language Disorders: A Review [J].
Boschi, Veronica ;
Catricala, Eleonora ;
Consonni, Monica ;
Chesi, Cristiano ;
Moro, Andrea ;
Cappa, Stefano F. .
FRONTIERS IN PSYCHOLOGY, 2017, 8 :1-21
[6]   Speech disorders in Parkinson's disease: early diagnostics and effects of medication and brain stimulation [J].
Brabenec, L. ;
Mekyska, J. ;
Galaz, Z. ;
Rektorova, Irena .
JOURNAL OF NEURAL TRANSMISSION, 2017, 124 (03) :303-334
[7]   Non-Motor symptoms in Portuguese Parkinson's Disease patients: correlation and impact on Quality of Life and Activities of Daily Living [J].
Bugalho, Paulo ;
Lampreia, Tania ;
Miguel, Rita ;
Mendonca, Marcelo D. ;
Caetano, Andre ;
Barbosa, Raquel .
SCIENTIFIC REPORTS, 2016, 6
[8]   Application of the Lognormal Model to the Vocal Tract Movement to Detect Neurological Diseases in Voice [J].
Carmona-Duarte, Cristina ;
Plamondon, Rejean ;
Gomez-Vilda, Pedro ;
Ferrer, Miguel A. ;
Alonso, Jesus B. ;
Londral, Ana Rita M. .
INNOVATION IN MEDICINE AND HEALTHCARE 2016, 2016, 60 :25-35
[9]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[10]   ECG signal classification using Convolutional Neural Networks for Biometric Identification [J].
Cordos, Claudia ;
Mihaila, Laura ;
Farago, Paul ;
Hintea, Sorin .
2021 44TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2021, :167-170