Predictive Analysis of Cervical Cancer Using Machine Learning Techniques

被引:0
作者
Kumawat, Gaurav [1 ]
Vishwakarma, Santosh Kumar [1 ]
Chakrabarti, Prasun [2 ]
机构
[1] Manipal Univ Jaipur, Dept Comp Sci & Engn, Jaipur 303007, Rajasthan, India
[2] Sir Padampat Singhania Univ, Dept Comp Sci & Engn, Udaipur 313601, Rajasthan, India
来源
SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024 | 2024年 / 945卷
关键词
Cervical cancer; Prediction; Machine learning; SMOTE; Voting; PROGNOSIS;
D O I
10.1007/978-981-97-1320-2_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Precautionary measures are less expensive than medical therapy in nearly every country. Any disease that is discovered early on has a higher probability of successfully treating its patient than one that is discovered later in its development. Any therapy we could provide them with would be helpful and would make their lives more pleasant if we did not know how to cure them. Cervical cancer is one of these diseases; it is the fourth-most common kind among women globally. The use of hormonal contraceptives and age are two of the numerous variables that raise the risk of cervical cancer. Cervical cancer mortality rates decrease, and recovery rates are increased with early diagnosis. The goal of this study is to develop a model that can sensitively and correctly detect cervical cancer using machine learning techniques. The voting mechanism will be used that integrates three classifiers logistic regression, decision tree, and random forest. The imbalanced dataset issue was resolved by using SMOTE in conjunction with principal component analysis (PCA) to eliminate dimensions that have no bearing on model accuracy. Next, to avoid the overfitting issue, a stratified tenfold cross-validation procedure was employed. The four target variables in this dataset-Hinselmann, Cytology, Schiller, and Biopsy-are linked to 32 risk factors. For each of the four target variables, we discovered that applying the voting classifier, SMOTE, and PCA approaches helped increase the prediction models' accuracy, ROC-AUC, and sensitivity to greater rates. Accuracy, PPA, and sensitivity ratios increased in the SMOTE-voting model for all target variables by 2.45-5.74%, 2.33-26.84%, and 33.98-42.54%, respectively.
引用
收藏
页码:501 / 516
页数:16
相关论文
共 33 条
[1]   Cervical Cancer Diagnosis Using Random Forest Classifier With SMOTE and Feature Reduction Techniques [J].
Abdoh, Sherif F. ;
Rizka, Mohamed Abo ;
Maghraby, Fahima A. .
IEEE ACCESS, 2018, 6 :59475-59485
[2]   Comparative analysis on Prediction Models with various Data Preprocessings in the Prognosis of Cervical Cancer [J].
Abdullah ;
Bin Ashraf, Faisal ;
Momo, Nusrat Suzana .
2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
[3]  
Akyol Kemal, 2018, International Journal of Information Engineering and Electronic Business, V10, P1, DOI 10.5815/ijieeb.2018.05.01
[4]  
Al-Eid C, 2010, BDS, DFE, "cancer incidence report Saudi Arabia 2010"
[5]  
Alam TM, 2019, INT J ADV COMPUT SC, V10, P388
[6]  
Alrawaji WA, 2015, Cancer incidence report
[7]  
[Anonymous], 2018, Global Cancer Observatory
[8]  
[Anonymous], 2004, ROC graphs: Notes and practical considerations for researchers
[9]  
Boateng E. Y., 2019, J Data Anal Inf Process, V7, P190, DOI [10.4236/jdaip.2019.74012, DOI 10.4236/JDAIP.2019.74012]
[10]   A Genetic Programming-Based Imputation Method for Classification with Missing Data [J].
Cao Truong Tran ;
Zhang, Mengjie ;
Andreae, Peter .
GENETIC PROGRAMMING, EUROGP 2016, 2016, 9594 :149-163