A novel sample and feature dependent ensemble approach for Parkinson's disease detection

被引:26
作者
Ali, Liaqat [1 ]
Chakraborty, Chinmay [2 ]
He, Zhiquan [3 ]
Cao, Wenming [3 ]
Imrana, Yakubu [4 ]
Rodrigues, Joel J. P. C. [5 ,6 ]
机构
[1] Univ Sci & Technol, Dept Elect Engn, Bannu, Pakistan
[2] Birla Inst Technol, Dept Elect & Commun Engn, Jharkhand, India
[3] Shenzhen Univ, Guangdong Multimedia Informat Serv Engn Technol R, Shenzhen 518000, Peoples R China
[4] Univ Elect Sci & Technol China UESTC, Sch Comp Sci & Engn, Chengdu, Peoples R China
[5] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao 266555, Peoples R China
[6] Inst Telecomunicacoes, Covilha Delegat, Covilha, Portugal
基金
中国国家自然科学基金;
关键词
Ensemble learning; F-score based feature selection; hyperparameters optimization; deep neural networks; Parkinson's disease; VOICE RECORDING REPLICATIONS; HYBRID INTELLIGENT SYSTEM; MULTIPLE TYPES; FEATURE-SELECTION; EARLY-DIAGNOSIS; CLASSIFICATION; PREDICTION; CANCER;
D O I
10.1007/s00521-022-07046-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parkinson's disease (PD) is a neurological disease that has been reported to have affected most people worldwide. Recent research pointed out that about 90% of PD patients possess voice disorders. Motivated by this fact, many researchers proposed methods based on multiple types of speech data for PD prediction. However, these methods either face the problem of low rate of accuracy or lack generalization. To develop an approach that will be free of these issues, in this paper we propose a novel ensemble approach. These paper contributions are two folds. First, investigating feature selection integration with deep neural network (DNN) and validating its effectiveness by comparing its performance with conventional DNN and other similar integrated systems. Second, development of a novel ensemble model namely EOFSC (Ensemble model with Optimal Features and Sample Dependant Base Classifiers) that exploits the findings of recently published studies. Recent research pointed out that for different types of voice data, different optimal models are obtained which are sensitive to different types of samples and subsets of features. In this paper, we further consolidate the findings by utilizing the proposed integrated system and propose the development of EOFSC. For multiple types of vowel phonations, multiple base classifiers are obtained which are sensitive to different subsets of features. These features and sample-dependent base classifiers are integrated, and the proposed EOFSC model is constructed. To evaluate the final prediction of the EOFSC model, the majority voting methodology is adopted. Experimental results point out that feature selection integration with neural networks improves the performance of conventional neural networks. Additionally, feature selection integration with DNN outperforms feature selection integration with conventional machine learning models. Finally, the newly developed ensemble model is observed to improve PD detection accuracy by 6.5%.
引用
收藏
页码:15997 / 16010
页数:14
相关论文
共 59 条
[1]   Classification and diagnosis of cervical cancer with softmax classification with stacked autoencoder [J].
Adem, Kemal ;
Kilicarslan, Serhat ;
Comert, Onur .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 115 :557-564
[2]   A recurrence plot-based approach for Parkinson's disease identification [J].
Afonso, Luis C. S. ;
Rosa, Gustavo H. ;
Pereira, Clayton R. ;
Weber, Silke A. T. ;
Hook, Christian ;
Albuquerque, Victor Hugo C. ;
Papa, Joao P. .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 94 :282-292
[3]   A hybrid machine learning framework to predict mortality in paralytic ileus patients using electronic health records (EHRs) [J].
Ahmad, Fahad Shabbir ;
Ali, Liaqat ;
Raza-Ul-Mustafa ;
Khattak, Hasan Ali ;
Hameed, Tahir ;
Wajahat, Iram ;
Kadry, Seifedine ;
Bukhari, Syed Ahmad Chan .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (03) :3283-3293
[4]   A statistically rigorous deep neural network approach to predict mortality in trauma patients admitted to the intensive care unit [J].
Ahmed, Fahad Shabbir ;
Ali, Liaqat ;
Joseph, Bellal A. ;
Ikram, Asad ;
Ul Mustafa, Raza ;
Bukhari, Syed Ahmad Chan .
JOURNAL OF TRAUMA AND ACUTE CARE SURGERY, 2020, 89 (04) :736-742
[5]   Support vector machines combined with feature selection for breast cancer diagnosis [J].
Akay, Mehmet Fatih .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3240-3247
[6]   Development of Hepatitis Disease Detection System by Exploiting Sparsity in Linear Support Vector Machine to Improve Strength of AdaBoost Ensemble Model [J].
Akbar, Wasif ;
Wu, Wei-ping ;
Saleem, Sehrish ;
Farhan, Muhammad ;
Saleem, Muhammad Asim ;
Javeed, Ashir ;
Ali, Liaqat .
MOBILE INFORMATION SYSTEMS, 2020, 2020
[7]  
Al-Fatlawi AH, 2016, IEEE C EVOL COMPUTAT, P1324, DOI 10.1109/CEC.2016.7743941
[8]   An Approach Based on Mutually Informed Neural Networks to Optimize the Generalization Capabilities of Decision Support Systems Developed for Heart Failure Prediction [J].
Ali, L. ;
Bukhari, S. A. C. .
IRBM, 2021, 42 (05) :345-352
[9]   LDA-GA-SVM: improved hepatocellular carcinoma prediction through dimensionality reduction and genetically optimized support vector machine [J].
Ali, Liaqat ;
Wajahat, Iram ;
Golilarz, Noorbakhsh Amiri ;
Keshtkar, Fazel ;
Bukhari, Syed Ahmad Chan .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (07) :2783-2792
[10]   A Feature-Driven Decision Support System for Heart Failure Prediction Based on χ2 Statistical Model and Gaussian Naive Bayes [J].
Ali, Liaqat ;
Khan, Shafqat Ullah ;
Golilarz, Noorbakhsh Amiri ;
Yakubu, Imrana ;
Qasim, Iqbal ;
Noor, Adeeb ;
Nour, Redhwan .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2019, 2019