Identification and diagnosis of cervical cancer using a hybrid feature selection approach with the bayesian optimization-based optimized catboost classification algorithm

被引:0
作者
Dhar J. [1 ]
Roy S. [2 ]
机构
[1] Computer Science and Engineering, Indian Institute of Technology Ropar, Punjab, Rupnagar
[2] Department of CSE, JLD Engineering and Management College, West Bengal, Baruipur
关键词
Bayesian optimization; Catboost algorithm; Cervical cancer risk prediction; Genetic algorithm; Hybrid feature selection;
D O I
10.1007/s12652-024-04825-8
中图分类号
学科分类号
摘要
Cervical cancer is the most prevailing woman illness globally. Since cervical cancer is a very preventable illness, early diagnosis exhibits the most adaptive plan to lessen its global responsibility. However, because of infrequent knowledge, shortage of access to pharmaceutical centers, and costly schemes worldwide, most probably in emerging nations, the vulnerable subject populations cannot regularly experience the test. So, we need a clinical screening analysis to diagnose cervical cancer early and support the doctor to heal and evade cervical cancer?s spread in women?s other organs and save several lives. Thus, this paper introduces a novel hybrid approach to solve such problems: a hybrid feature selection approach with the Bayesian optimization-based optimized CatBoost (HFS-OCB) method to diagnose and predict cervical cancer risk. Genetic algorithm and mutual information approaches utilize feature selection methodology in this suggested research and form a hybrid feature selection (HFS) method to generate the most significant features from the input dataset. This paper also utilizes a novel Bayesian optimization-based hyperparameter optimization approach: optimized CatBoost (OCB) method to provide the most optimal hyperparameters for the CatBoost algorithm. The CatBoost algorithm is used to classify cervical cancer risk. There are two real-world, publicly available cervical cancer-based datasets utilized in this suggested research to evaluate and verify the suggested approach?s performance. A 20-fold cross-validation strategy and a renowned performance evaluation metric are utilized to assess the suggested approach?s performance. The outcome implies that the possibility of forming cervical cancer can be efficiently foretold using the suggested HFS-OCB method. Therefore, the suggested approach?s indicated result is compared with the other algorithms and provides the prediction. Such a predicted result shows that the suggested approach is more capable, reliable, scalable, and more effective than the other machine learning algorithms. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:3459 / 3477
页数:18
相关论文
共 37 条
  • [1] Abdoh S.F., Abo Rizka M., Maghraby F.A., Cervical cancer diagnosis using random forest classifier with SMOTE and feature reduction techniques, IEEE Access, 6, pp. 59475-59485, (2018)
  • [2] Adem K., Kilicarslan S., Comert O., Classification and diagnosis of cervical cancer with stacked autoencoder and softmax classification, Expert Syst Appl, 115, pp. 557-564, (2019)
  • [3] Alfaries A., Mengash H., Yasar A., Shakshuki E., Advances in data science, cyber security and IT applications: first international conference on computing, ICC 2019, Riyadh, Saudi Arabia, December 10?12, 2019, proceedings, (2019)
  • [4] Alsmariy R., Healy G., Abdelhafez H., Predicting cervical cancer using machine learning methods, Int J Adv Comput Sci Appl, (2020)
  • [5] Alyafeai Z., Ghouti L., A fully-automated deep learning pipeline for cervical cancer classification, Expert Syst Appl, 141, (2020)
  • [6] Alzubaidi A., Cosma G., Brown D., Pockley A.G., A new hybrid global optimization approach for selecting clinical and biological features that are relevant to the effective diagnosis of ovarian cancer.?, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).?, (2016)
  • [7] Amiri F., Rezaei Yousefi M., Lucas C., Shakery A., Yazdani N., Mutual information-based feature selection for intrusion detection systems, J Netw Comput Appl, 34, 4, pp. 1184-1199, (2011)
  • [8] Bhattacharjee A., Borgohain S.K., Soni B., Verma G., Gao X., Machine learning, image processing, network security and data sciences: second international conference, MIND 2020, Silchar, India, July 30?31, 2020, proceedings, part II, (2020)
  • [9] Billah S.M., Identifying Emerging Researchers Using Social Network Analysis, (2013)
  • [10] Cao A., He H., Chen Z., Zhang W., Performance evaluation of machine learning approaches for credit scoring, Int J Econ Finance Manage Sci, 6, 6, pp. 255-260, (2018)