Novel Enhanced-Grey Wolf Optimization hybrid machine learning technique for biomedical data computation

被引:62
作者
Chakraborty, Chinmay [1 ,2 ]
Kishor, Amit [3 ]
Rodrigues, Joel J. P. C. [4 ,5 ]
机构
[1] BIT, Dept Elect & Commun Engn, Mesra, Jharkhand, India
[2] Univ Fed Piaui, Teresina, PI, Brazil
[3] Swami Vivekanand Subharti Univ, Dept Comp Sci & Engn, Meerut, Uttar Pradesh, India
[4] Senac Fac Ceara, Res Dev & Innovat, BR-60160194 Fortaleza, Ceara, Brazil
[5] Inst Telecomunicacoes, Covilha Delegat, P-6201001 Covilha, Portugal
关键词
Machine learning; Biomedical data; Classification; Feature selection; Bagging technique; Boosting technique; Grey wolf optimization;
D O I
10.1016/j.compeleceng.2022.107778
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today, a significant number of biomedical data is generated continuously from various biomedical equipment and experiments due to rapid technological improvements in medical sciences. Effective biomedical data analysis, such as extracting biological and diagnostically significant features, is a very challenging task. This paper proposes hybrid Machine Learning Classification Techniques based on ensemble technique with Enhanced-Grey Wolf Optimization (E-GWO) feature selection algorithm to analyze these complex biomedical data. We combined five biomedical heart disease data sets for the experimental work: Cleveland, Long-Beach-VA, Switzerland, Hungarians, and Statlog. New hybrid Machine Learning Classification Techniques classifiers like Naive Bayes Bagging Technique (NBBT), Random Forest Bagging Technique (RFBT), Decision Tree Bagging Technique (DTBT), K-Nearest Neighbors Bagging Technique (KNNBT), Neural Network Bagging Technique (NNBT), Gradient Boosting Boosting Technique (GBBT), and Adaptive Boosting Boosting Technique (ABBT) are developed with bagging and boosting methods. Accuracy, Recall, Precision, F1-score, Specificity, Error Rate, G-mean, False Negative Rate (FNR), False Positive Rate (FPR), and Negative Predictive Value (NPV) are used to evaluate hybrid techniques. The experimental result shows that the developed hybrid classifier RFBT achieves the highest accuracy of 99.26% with E-GWO feature selection. The proposed method has 11.90% improved the accuracy to the conventional model.
引用
收藏
页数:15
相关论文
共 25 条
[1]  
Abdeldjouad Fatma Zahra, 2020, Impact of Digital Technologies on Public Health in Developed and Developing Countries. 18th International Conference, ICOST 2020. Proceedings. Lecture Notes in Computer Science (LNCS 12157), P299, DOI 10.1007/978-3-030-51517-1_26
[2]  
Alizadehsani Roohallah, 2013, Res Cardiovasc Med, V2, P133, DOI 10.5812/cardiovascmed.10888
[3]   A new complement naive Bayesian approach for biomedical data classification [J].
Anagaw, Amare ;
Chang, Yang-Lang .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (10) :3889-3897
[4]  
[Anonymous], **DATA OBJECT**
[5]   Research on Travel Time Prediction Model of Freeway Based on Gradient Boosting Decision Tree [J].
Cheng, Juan ;
Li, Gen ;
Chen, Xianhua .
IEEE ACCESS, 2019, 7 :7466-7480
[6]   Machine learning can predict survival of patients with heart failure from serum creatinine and ejection fraction alone [J].
Chicco, Davide ;
Jurman, Giuseppe .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
[7]   A fine-grained Random Forests using class decomposition: an application to medical diagnosis [J].
Elyan, Eyad ;
Gaber, Mohamed Medhat .
NEURAL COMPUTING & APPLICATIONS, 2016, 27 (08) :2279-2288
[8]   Sample size estimation in diagnostic test studies of biomedical informatics [J].
Hajian-Tilaki, Karimollah .
JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 48 :193-204
[9]  
Kannan R, 2019, SPRINGERBR APPL SCI, P63, DOI 10.1007/978-981-13-0059-2_8
[10]   A Comprehensive Survey for Intelligent Spam Email Detection [J].
Karim, Asif ;
Azam, Sami ;
Shanmugam, Bharanidharan ;
Kannoorpatti, Krishnan ;
Alazab, Mamoun .
IEEE ACCESS, 2019, 7 :168261-168295