CNN-Based Identification of Parkinson's Disease from Continuous Speech in Noisy Environments

被引：14

作者：

Farago, Paul ^{[1
]}

Stefaniga, Sebastian-Aurelian ^{[2
]}

Cordos, Claudia-Georgiana ^{[1
]}

Mihaila, Laura-Ioana ^{[1
]}

Hintea, Sorin ^{[1
]}

Pestean, Ana-Sorina ^{[3
]}

Beyer, Michel ^{[4
,5
]}

Perju-Dumbrava, Lacramioara ^{[3
]}

Ilesan, Robert Radu ^{[3
,4
]}

机构：

[1] Tech Univ Cluj Napoca, Fac Elect Telecommun & Informat Technol, Bases Elect Dept, Cluj Napoca 400114, Romania

[2] West Univ Timisoara, Fac Math & Comp Sci, Dept Comp Sci, Timisoara 300223, Romania

[3] Univ Med Pharm Iuliu Hatieganu Cluj Napoca, Fac Med, Dept Neurol & Pediat Neurol, Cluj Napoca 400012, Romania

[4] Univ Hosp Basel, Clin Oral & Cranio Maxillofacial Surg, CH-4031 Basel, Switzerland

[5] Univ Basel, Dept Biomed Engn, Med Addit Mfg Res Grp Swiss MAM, CH-4123 Allschwil, Switzerland

来源：

BIOENGINEERING-BASEL | 2023年 / 10卷 / 05期

关键词：

speech assessment; hypokinetic dysarthria; artificial intelligence; Parkinson's disease; continuous speech; noisy speech; pre-diagnosis; convolutional neural networks; spectrograms; Wiener filter;

D O I：

10.3390/bioengineering10050531

中图分类号：

Q81 [生物工程学（生物技术）]; Q93 [微生物学];

学科分类号：

071005 ; 0836 ; 090102 ; 100705 ;

摘要：

Parkinson's disease is a progressive neurodegenerative disorder caused by dopaminergic neuron degeneration. Parkinsonian speech impairment is one of the earliest presentations of the disease and, along with tremor, is suitable for pre-diagnosis. It is defined by hypokinetic dysarthria and accounts for respiratory, phonatory, articulatory, and prosodic manifestations. The topic of this article targets artificial-intelligence-based identification of Parkinson's disease from continuous speech recorded in a noisy environment. The novelty of this work is twofold. First, the proposed assessment workflow performed speech analysis on samples of continuous speech. Second, we analyzed and quantified Wiener filter applicability for speech denoising in the context of Parkinsonian speech identification. We argue that the Parkinsonian features of loudness, intonation, phonation, prosody, and articulation are contained in the speech, speech energy, and Mel spectrograms. Thus, the proposed workflow follows a feature-based speech assessment to determine the feature variation ranges, followed by speech classification using convolutional neural networks. We report the best classification accuracies of 96% on speech energy, 93% on speech, and 92% on Mel spectrograms. We conclude that the Wiener filter improves both feature-based analysis and convolutional-neural-network-based classification performances.

引用

页数：37

共 68 条

[1] A new diagnostic approach for the identification of patients with neurodegenerative cognitive complaints [J].