Cervical Cancer Identification with Synthetic Minority Oversampling Technique and PCA Analysis using Random Forest Classifier

被引:0
作者
R. Geetha
S. Sivasubramanian
M. Kaliappan
S. Vimal
Suresh Annamalai
机构
[1] Bharath Institute of Higher Education and Research,Department of Computer Science and Engineering
[2] Mohamed Sathak A J Engineering College,Department of Information Technology
[3] Ramco Institute of Technology,Department of CSE
[4] National Engineering College,undefined
[5] Nehru Institute of Engineering and Technology,undefined
来源
Journal of Medical Systems | 2019年 / 43卷
关键词
Cervical cancer; Random Forest; PCA; RFE; SMOTE; RSOnto;
D O I
暂无
中图分类号
学科分类号
摘要
Cervical cancer is the fourth most communal malignant disease amongst women worldwide. In maximum circumstances, cervical cancer indications are not perceptible at its initial stages. There are a proportion of features that intensify the threat of emerging cervical cancer like human papilloma virus, sexual transmitted diseases, and smoking. Ascertaining those features and constructing a classification model to categorize, if the cases are cervical cancer or not is an existing challenging research. This learning intentions at using cervical cancer risk features to build classification model using Random Forest (RF) classification technique with the synthetic minority oversampling technique (SMOTE) and two feature reduction techniques recursive feature elimination and principle component analysis (PCA). Utmost medical data sets are frequently imbalanced since the number of patients is considerably fewer than the number of non-patients. For the imbalance of the used data set, SMOTE is cast-off to solve this problem. The data set comprises of 32 risk factors and four objective variables: Hinselmann, Schiller, Cytology and Biopsy. Accuracy, Sensitivity, Specificity, PPA and NPA of the four variables remains accurate after SMOTE when compared with values obtained before SMOTE. An RSOnto ontology has been created to visualize the progress in classification performance.
引用
收藏
相关论文
共 48 条
  • [1] Saha A(2010)Awareness of cervical cancer among female students of premier colleges in Kolkata, India Asian Paci c J. Cancer Prevention 11 1085 1090-313
  • [2] Chaudhury AN(2016)Cervical cancer: Sociode-mographic and clinical risk factors among adult Egyptian females J. Oncol. Res. Treat. 1 7-83
  • [3] Bhowmik P(2018)Cancer statistics, 2018 CA,Cancer J. Clin. 68 7 30-43
  • [4] Chatterjee R(2015)Enhancing secure routing in Mobile Ad Hoc Networks using a Dynamic Bayesian Signalling Game model Journal of Computers & Electrical Engineering 41 301-32
  • [5] El-Moselhy EA(2015)Development of a secure routing protocol using game theory model in mobile ad hoc networks Journal of Communications and Networks 17 75-undefined
  • [6] Borg HM(2016)Enhancing energy efficiency and load balancing in mobile ad hoc network using dynamic genetic algorithms Journal of Network and Computer Applications 73 35-undefined
  • [7] Atlam SA(2014)Application of machine learning to predict the recurrence-proneness for cervical cancer Neural Comput. Appl. 24 1311 1316-undefined
  • [8] Siegel RL(2014)A risk evaluation model of cervical cancer based on etiol-ogy and human leukocyte antigen allele susceptibility Int. J. InfectionDiseases 28 8 12-undefined
  • [9] Miller KD(2016)Cervical cancer stage prediction using decision tree approach of machine learning Int. J. Adv. Res. Comput. Commun. Eng. 5 345 348-undefined
  • [10] Jemal A(2017)Data-driven diagnosis of cervical cancer with support vector machine-based approaches IEEE Access 5 2239 2249-undefined