Improving prediction of cervical cancer using KNN imputer and multi-model ensemble learning

被引:7
作者
Aljrees, Turki [1 ]
机构
[1] Univ Hafr Al Batin, Coll Comp Sci & Engn, Hafar al Batin, Saudi Arabia
来源
PLOS ONE | 2024年 / 19卷 / 01期
基金
英国科研创新办公室;
关键词
HUMAN-PAPILLOMAVIRUS; MACHINE; CLASSIFICATION; LEVEL;
D O I
10.1371/journal.pone.0295632
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cervical cancer is a leading cause of women's mortality, emphasizing the need for early diagnosis and effective treatment. In line with the imperative of early intervention, the automated identification of cervical cancer has emerged as a promising avenue, leveraging machine learning techniques to enhance both the speed and accuracy of diagnosis. However, an inherent challenge in the development of these automated systems is the presence of missing values in the datasets commonly used for cervical cancer detection. Missing data can significantly impact the performance of machine learning models, potentially leading to inaccurate or unreliable results. This study addresses a critical challenge in automated cervical cancer identification-handling missing data in datasets. The study present a novel approach that combines three machine learning models into a stacked ensemble voting classifier, complemented by the use of a KNN Imputer to manage missing values. The proposed model achieves remarkable results with an accuracy of 0.9941, precision of 0.98, recall of 0.96, and an F1 score of 0.97. This study examines three distinct scenarios: one involving the deletion of missing values, another utilizing KNN imputation, and a third employing PCA for imputing missing values. This research has significant implications for the medical field, offering medical experts a powerful tool for more accurate cervical cancer therapy and enhancing the overall effectiveness of testing procedures. By addressing missing data challenges and achieving high accuracy, this work represents a valuable contribution to cervical cancer detection, ultimately aiming to reduce the impact of this disease on women's health and healthcare systems.
引用
收藏
页数:24
相关论文
共 70 条
  • [1] Cervical Cancer Diagnosis Using Random Forest Classifier With SMOTE and Feature Reduction Techniques
    Abdoh, Sherif F.
    Rizka, Mohamed Abo
    Maghraby, Fahima A.
    [J]. IEEE ACCESS, 2018, 6 : 59475 - 59485
  • [2] A Model for Predicting Cervical Cancer Using Machine Learning Algorithms
    Al Mudawi, Naif
    Alazeb, Abdulwahab
    [J]. SENSORS, 2022, 22 (11)
  • [3] Cervical Cancer Classification Using Combined Machine Learning and Deep Learning Approach
    Alquran, Hiam
    Mustafa, Wan Azani
    Abu Qasmieh, Isam
    Yacob, Yasmeen Mohd
    Alsalatie, Mohammed
    Al-Issa, Yazan
    Alqudah, Ali Mohammad
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5117 - 5134
  • [4] Alsmariy R, 2020, INT J ADV COMPUT SC, V11, P173
  • [5] [Anonymous], 2022, Seventy-second Regional Committee for Europe: Tel Aviv, 12-14 September 2022: case examples of applying behavioural and cultural insights (BCI) to health-related policies, services and communication processes
  • [6] Estimates of incidence and mortality of cervical cancer in 2018: a worldwide analysis
    Arbyn, Marc
    Weiderpass, Elisabete
    Bruni, Laia
    de Sanjose, Silvia
    Saraiya, Mona
    Ferlay, Jacques
    Bray, Freddie
    [J]. LANCET GLOBAL HEALTH, 2020, 8 (02): : E191 - E203
  • [7] A Deep Learning-Based Smart Framework for Cyber-Physical and Satellite System Security Threats Detection
    Ashraf, Imran
    Narra, Manideep
    Umer, Muhammad
    Majeed, Rizwan
    Sadiq, Saima
    Javaid, Fawad
    Rasool, Nouman
    [J]. ELECTRONICS, 2022, 11 (04)
  • [8] LR-HIDS: logistic regression host-based intrusion detection system for cloud environments
    Besharati, Elham
    Naderan, Marjan
    Namjoo, Ehsan
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (09) : 3669 - 3692
  • [9] Cervical cancer prediction using stacked ensemble algorithm with SMOTE and RFERF
    Bhavani C.H.
    Govardhan A.
    [J]. Materials Today: Proceedings, 2023, 80 : 3451 - 3457
  • [10] [Anonymous], 2020, CA Cancer J Clin, V70, P313, DOI [10.3322/caac.21492, 10.3322/caac.21609]