Feature selection and extraction for class prediction in dysphonia measures analysis: A case study on Parkinson's disease speech rehabilitation

被引:10
作者
El Moudden, Ismail [1 ]
Ouzir, Mounir [2 ]
ElBernoussi, Souad [1 ]
机构
[1] Mohammed V Univ Rabat, Fac Sci, Dept Math, Lab Math Comp Sci & Applicat, POB 1014, Rabat, Morocco
[2] Mohammed V Univ Rabat, Fac Sci, Dept Biol, Lab Biochem & Immunol, Rabat, Morocco
关键词
Dimension reduction; classification; machine learning; dysphonia features; Parkinson's disease; INTENSIVE VOICE TREATMENT; DIMENSION REDUCTION; TREATMENT LSVT(R); LARGE-SAMPLE; INDIVIDUALS; ALGORITHMS;
D O I
10.3233/THC-170824
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BACKGROUND: Speech disorders such as dysphonia and dysarthria represent an early and common manifestation of Parkinson's disease. Class prediction is an essential task in automatic speech treatment, particularly in the Parkinson's disease case. Many classification experiments have been performed which focus on the automatic detection of Parkinson's disease patients from healthy speakers but results are still not optimistic. A major problem in accomplishing this task is high dimensionality of speech data. OBJECTIVE: In this work, the potential of Principal Component Analysis (PCA) based modeling in dimensionality reduction is taken into consideration as the data smoothening tool with multiclass target expression data. METHODS: On the basis of suggested PCA-based modeling, the power of class prediction using logistic regression (LR) and C5.0 in numeric data is investigated in publicly available Parkinson's disease dataset Silverman voice treatment (LSVT) to develop an advanced classification model. RESULTS: The main advantage of our model is the effective reduction of the number of factors from p = 309 to k = 32 for LSVT Voice Rehabilitation dataset, with a fine classification accuracy of 100% and 99.92% for PCA-LR and PCA-C5.0 respectively. In addition, using only 9 dysphonia features, classification accuracy was (99.20%) and (99.11%) for PCA-LR, and PCA-C5.0 respectively. CONCLUSIONS: Our combined dimension reduction and data smoothening approaches have significant potential to minimize the number of features and increase the classification accuracy and then automatically classify subjects into Parkinson's disease patients or healthy speakers.
引用
收藏
页码:693 / 708
页数:16
相关论文
共 61 条
  • [1] [Anonymous], PHIL T R SOC
  • [2] [Anonymous], THESIS
  • [3] [Anonymous], J TELEMED APPL
  • [4] [Anonymous], DATA MINING TOOLS SE
  • [5] [Anonymous], P 2009 2 INT WORKSH
  • [6] [Anonymous], SAMI 2015 IEEE 13 IN
  • [7] [Anonymous], NEUROCOMPUTING
  • [8] [Anonymous], THESIS
  • [9] [Anonymous], ELEMENTS STAT LEARNI
  • [10] [Anonymous], 2009, P 422 3 INT C SIGNAL