Data Driven Prognosis of Cervical Cancer Using ClassBalancing and Machine Learning Techniques

被引:3
作者
Arora M. [1 ,2 ]
Dhawan S. [1 ,3 ]
Singh K. [1 ,3 ]
机构
[1] Department of Computer Science & Engineering, University Institute of Engineering & Technology (U.I.E.T), Kurukshetra University, Kurukshetra
[2] Department of Computer Science and Technology, Manav Rachna University, Faridabad
[3] Faculty of Computer Science & Engineering, Department of Computer Science & Engineering, University Institute of Engineering & Technology (U.I.E.T), Kurukshetra University, Kurukshetra
关键词
Cervical Cancer; K-Nearest Neighbour; Random forest; Random over-sampling; random under-sampling; SMOTE; Support vector machine;
D O I
10.4108/eai.13-7-2018.164264
中图分类号
学科分类号
摘要
INTRODUCTION: With the progression of innovation and its joint effort with health care services, the world has achieved a lot of benefits. AI procedures and machine learning techniques are constantly improving existing statistical methods for better results in the medical field. These improved methods will assist health care providers in providing intelligent medical services. OBJECTIVES: This Cervical cancer is the fourth most common cancer among the other female cancers. This cancer is preventable with early diagnosis. This reason becomes the motivation of the research work. For efficiently and timely prognosis of cervical cancer require a computer-assisted algorithm METHODS: The work demonstrated in this paper contributes to the techniques of machine learning for diagnosing cervical cancer. The machine learning algorithms used in this research are K Nearest Neighbour, Support Vector Machine and Random Forest Tree. These classification algorithms are used with class balancing techniques including under-sampling, Oversampling and SMOTE. RESULTS: The evaluation metrics used for comparative analysis includes accuracy, sensitivity, specificity, negative predicted accuracy, and positive predictive accuracy. The results show the Random Forest algorithm with SMOTE technique delivered more promising results over SVM and KNN for four target variables Schiller, Biopsy, Hinselmann, and Cytology respectively. CONCLUSION: It is concluded that with the limited amount of data which also suffers from the unbalancing problem the promising results drawn using the proposed model. ©2020 Mamta Arora et al., licensed to EAI. This is an open access article distributed under the terms of theCreative Commons Attribution licence (http://creativecommons.org/licenses/by/3.0/), which permits unlimited use, distribution and reproduction in any medium so long as the original work is properly cited.
引用
收藏
页码:1 / 9
页数:8
相关论文
共 28 条
  • [1] Fernandes Kelwin, Cardoso Jaime S., Fernandes Jessica, Transfer Learning with Partial Observability Applied to Cervical Cancer Screening, Proceedings of Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA), pp. 243-250, (2017)
  • [2] Yang Wenying, Gou Xin, Xu Tongqing, Yi Xiping, Jiang Maohong, Cervical Cancer Risk Prediction Model and Analysis of Risk Factors based on Machine Learning, Proceedings of 11th International Conference on Bioinformatics and Biomedical Technology, pp. 50-54, (2019)
  • [3] Parikh D., Menon V., Machine Learning Applied to Cervical Cancer Data, International Journal of Mathematical Sciences and Computing, 5, 1, pp. 53-64, (2019)
  • [4] Wu W., Zhou H., Data-driven diagnosis of cervical cancer with support vector machine-based approaches, IEEE Access, 5, pp. 25189-25195, (2017)
  • [5] Sarwar A., Ali M., Sharma V., Performance Evaluation of Machine Learning Techniques for Screening of Cervical Cancer, 2nd International Conference on Computing for Sustainable Global Development (INDIACom), pp. 2297-2303, (2015)
  • [6] Hemalatha K., Rani U., An Optimal Neural Network Classifier for Cervical Pap smear Data, 7th International Advance Computing Conference(IACC), pp. 110-114, (2017)
  • [7] Latha D.S., Lakshmi P., Fathima S., Staging Prediction in Cervical Cancer Patients-A Machine Learning Approach, International Journal of Innovative research and Practices, 2, 2, pp. 14-23, (2014)
  • [8] Vidya R., Nasira G., Prediction of Cervical Cancer using Hybrid Induction Technique: A Solution for Human Hereditary Disease Patterns, Indian Journal of Science & Technology, 9, 30, pp. 1-10, (2016)
  • [9] Kaur Prableen, Sharma Manik, Diagnosis of Human Psychological Disorders using Supervised Learning and Nature-Inspired Computing Techniques: A MetaAnalysis, Journal of medical systems, 7, 43, (2019)
  • [10] Manogaran Gunasekaran, Et al., Machine learning-based big data processing framework for cancer diagnosis using hidden Markov model and GM clustering, Wireless personal communications, 3, 102, pp. 2099-2116, (2018)