Diabetes disease prediction using firefly optimization-based cat-boost classifier in big data analytics

被引:2
作者
Jenefer, G. Geo [1 ]
Deepa, A. J. [2 ]
机构
[1] St Xaviers Catholic Coll Engn, Dept Informat Technol, Chunkankadai, Tamil Nadu, India
[2] Ponjesly Coll Engn, Dept Comp Sci & Engn, Nagercoil, Tamil Nadu, India
关键词
CatBoost(CB); feature scaling; machine learning;
D O I
10.3233/JIFS-223105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Globally, diabetes directly causes 1.5 million fatalities each year. It is necessary to predict such diseases at an earlier stage and cure them. Since modern healthcare data comprises huge amounts of information, it is tough to process such data in conventional databases. Previously, various machine learning (ML) algorithms were used to predict diabetics, and their performance was evaluated. But still, those existing algorithms result in poor accuracy and performance. This work proposes a FOCB (Firefly Optimization-based CatBoost) classifier for predicting diabetes. The PIMA Indian diabetic dataset has been taken as the input dataset. The proposed FOCB algorithm has been compared with various machine learning algorithms. From the results, we can see that the FOCB classifier gives the best accuracy of 96% with improved performance. The proposed system has been compared with other FO-based machine learning algorithms like NB, KNN, RF, AB, GB, XGB, CNN, DBN, and CB, and it has been proven that CB based on FO produces better accuracy with less hamming loss.
引用
收藏
页码:9943 / 9954
页数:12
相关论文
共 26 条
[1]  
[Anonymous], 2016, 2016 INT C COMPUTER
[2]   An Efficient Prediction Model for Diabetic Database Using Soft Computing Techniques [J].
Bhat, Veena H. ;
Rao, Prasanth G. ;
Shenoy, P. Deepa ;
Venugopal, K. R. ;
Patnaik, L. M. .
ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 :328-+
[3]   Disease Prediction by Machine Learning Over Big Data From Healthcare Communities [J].
Chen, Min ;
Hao, Yixue ;
Hwang, Kai ;
Wang, Lu ;
Wang, Lin .
IEEE ACCESS, 2017, 5 :8869-8879
[4]  
Dai Wei., 2014, International Journal of Database Theory and Application, V7, P49, DOI [DOI 10.14257/IJDTA.2014.7.1.05, 10.14257/ijdta.2014.7.1.05]
[5]  
Geo Jenefer G., 2018, J COMPUTATIONAL INFO, V14
[6]  
Gowsalya M, 2014, INT CONF ADV COMPU, P297, DOI 10.1109/ICoAC.2014.7229729
[7]  
Iyer A, 2015, Arxiv, DOI arXiv:1502.03774
[8]  
Jenefer G.G., 2021, REFINING PERFORMANCE
[9]  
Johari Nur Farahlina, 2013, Applied Mechanics and Materials, V421, P512, DOI 10.4028/www.scientific.net/AMM.421.512
[10]  
Kalyankar GD, 2017, 2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), P619, DOI 10.1109/I-SMAC.2017.8058253