Diagnose Parkinson?s disease and cleft lip and palate using deep convolutional neural networks evolved by IP-based chimp optimization algorithm

被引:46
作者
Chen, Feng [1 ,2 ]
Yang, Chunyan [1 ]
Khishe, Mohammad [3 ]
机构
[1] Wenzhou Med Univ, Fac Informat, Wenzhou 325035, Zhejiang, Peoples R China
[2] Ctr China Normal Univ, Fac Artificial Intelligence Educ, Wuhan 430079, Hubei, Peoples R China
[3] Imam Khomeini Marine Sci Univ, Dept Elect Engn, Nowshahr, Iran
关键词
Pathological speech; Deep convolutional neural networks; Parkinson?s disease; Cleft lip and palate; Chimp optimization algorithm; SPEECH ENHANCEMENT; CLASSIFICATION; VOICE;
D O I
10.1016/j.bspc.2022.103688
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Speech signals often include paralinguistic features such as pathologies that impair a speaker's capability to communicate. Those cognitive symptoms have various causes depending on the disease. For example, morphological diseases like cleft lip and palate create hypernasality, while neurodegenerative conditions like Parkinson's disease cause hypokinetic dysarthria. Automatic assessment of abnormal speech supports early diagnosis or disease severity evaluation. Conventional methods rely on manually assessing single aspects like shimmer, jitter, or formant frequencies, which may not fully reflect the disease's manifestations. In this paper, we use deep convolutional neural networks (DCNNs) to recognize disordered speech. Despite DCNNs' many approved benefits, selecting the best structure is challenging. In order to overcome this issue, this research looks into using the chimp optimization algorithm (ChOA) to automatically select the optimal DCNN structure. In order to achieve the goal, three ChOA-based advancements are proposed. First, an internet protocol address-based (IPA-based) encoding method for DCNN layers employing chimp vectors is created. Then an Enfeebled layer with specified chimp vector dimensions is presented for variable-length DCNNs. Eventually, large datasets are partitioned into smaller ones and evaluated at random to recognize abnormal speech signals from patients with Parkinson's disease and cleft lip and palate. In addition to receiver operating characteristic (ROC) and precision-recall curves, five well-known metrics were used: sensitivity, specificity, accuracy, precision, F1-Score. The proposed model accurately diagnoses disordered and normal speech signals, with an accuracy of up to 96.37%, which is 1.62 more accurate than the second-best approach, i.e., VLNSGA-II.
引用
收藏
页数:14
相关论文
共 61 条
[1]  
Abayomi-Alli Olusola O., 2020, 2020 15th Conference on Computer Science and Information Systems (FedCSIS), P371, DOI 10.15439/2020F188
[2]  
Abbas SA, 2018, DIYALA J PURE SCI, V14, P220
[3]  
Akuzawa K., 2018, In INTERSPEECH
[4]  
Alhinti L., 2020, International Journal of Psychological and Behavioral Sciences, V14, P187
[5]   Adaptive CCR-ELM with variable-length brain storm optimization algorithm for class-imbalance learning [J].
Cheng, Jian ;
Chen, Jingjing ;
Guo, Yi-nan ;
Cheng, Shi ;
Yang, Linkai ;
Zhang, Pei .
NATURAL COMPUTING, 2021, 20 (01) :11-22
[6]   A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms [J].
Derrac, Joaquin ;
Garcia, Salvador ;
Molina, Daniel ;
Herrera, Francisco .
SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (01) :3-18
[7]   Characterization and classification of Parkinson's disease patients based on symbolic dynamics analysis of heart rate variability [J].
Dorantes-Mendez, Guadalupe ;
Mendez, Martin O. ;
Mendez-Magdaleno, Laura E. ;
Munoz-Mata, Brenda G. ;
Rodriguez-Leyva, Ildefonso ;
Mejia-Rodriguez, Aldo R. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
[8]  
ER M.B., 2021, PARKINSONS DETECTION
[9]   Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach [J].
Fang, Shih-Hau ;
Tsao, Yu ;
Hsiao, Min-Jing ;
Chen, Ji-Ying ;
Lai, Ying-Hui ;
Lin, Feng-Chuan ;
Wang, Chi-Te .
JOURNAL OF VOICE, 2019, 33 (05) :634-641
[10]  
Fu S.-W., 2017, Workshop of MLSP, P1