Enhancing Cervical Cancer Prediction: A Comparative Analysis of Machine Learning Algorithms and Development of a Novel Screening Tool

被引:0
作者
Edafetanure-Ibeh, Faith Tobore [1 ]
机构
[1] Harrisburg Univ Sci & Technol, Data Sci PhD, Harrisburg, PA 17101 USA
来源
2024 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI 2024 | 2024年
关键词
Cervical Cancer; Machine Learning; Predictive Modeling; XGBoost; Classification; Healthcare Analytics;
D O I
10.1109/IRI62200.2024.00055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The early discovery of cervical cancer is crucial for efficient treatment and increased survival rates, making it a severe public health concern [1]. This study uses a consistent dataset to compare various machine-learning methods for cervical cancer prediction. We utilized a variety of machine learning techniques, including Random Forest, Naive Bayes, Support Vector Machine (SVM) with a linear kernel, K-Nearest Neighbors (KNN), Logistic Regression, and Extreme Gradient Boosting (XGBoost), to identify and forecast the risk of cervical cancer. Based on the accuracy, precision, recall, F1-score, and confusion matrices, the effectiveness of these algorithms was assessed [2]. The most appropriate model for this application is XGBoost, which fared better than other models in recall and F1-score, even if more conventional methods, such as Random Forest and KNN, showed excellent overall accuracy. The study results imply that XGBoost has excellent potential for creating an efficient cervical cancer screening tool due to its balance of sensitivity and precision. The model is then integrated into a web-based application and an interactive chatbot designed to facilitate early detection and assessment of cervical cancer risks.
引用
收藏
页码:228 / 233
页数:6
相关论文
共 20 条
[1]   Preventing Cervical Cancer Globally: Are We Making Progress? [J].
Allanson, Emma R. ;
Schmeler, Kathleen M. .
CANCER PREVENTION RESEARCH, 2021, 14 (12) :1055-1060
[2]  
[Anonymous], 2019, Human papillomavirus (HPV) and cervical cancer
[3]   Estimates of incidence and mortality of cervical cancer in 2018: a worldwide analysis [J].
Arbyn, Marc ;
Weiderpass, Elisabete ;
Bruni, Laia ;
de Sanjose, Silvia ;
Saraiya, Mona ;
Ferlay, Jacques ;
Bray, Freddie .
LANCET GLOBAL HEALTH, 2020, 8 (02) :E191-E203
[4]  
Bohr A, 2020, Artificial intelligence in healthcare
[5]   Implementing Machine Learning in Health Care - Addressing Ethical Challenges [J].
Char, Danton S. ;
Shah, Nigam H. ;
Magnus, David .
NEW ENGLAND JOURNAL OF MEDICINE, 2018, 378 (11) :981-983
[6]  
Chen J., 2024, How a histogram works to display data
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]   Cervical cancer [J].
Cohen, Paul A. ;
Jhingran, Anjua ;
Oaknin, Ana ;
Denny, Lynette .
LANCET, 2019, 393 (10167) :169-182
[9]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[10]   The value of CT, MRI, and PET-CT in detecting retropharyngeal lymph node metastasis of head and neck squamous cell carcinoma [J].
Kim, Jin Hwan ;
Choi, Kyu Young ;
Lee, Sang-Hyo ;
Lee, Dong Jin ;
Park, Bum Jung ;
Yoon, Dae Young ;
Rho, Young-Soo .
BMC MEDICAL IMAGING, 2020, 20 (01)