Performance analysis of different machine learning algorithms in breast cancer predictions

被引:0
作者
Battineni G. [1 ]
Chintalapudi N. [1 ]
Amenta F. [1 ]
机构
[1] Telemedicine and Telepharmacy Center, School of Medicinal and Health Products Sciences, University of Camerino, Camerino
来源
Battineni, Gopi (gopi.battineni@unicam.it) | 1600年 / European Alliance for Innovation卷 / 06期
关键词
Accuracy; AUC; Feature selection; Machine learning; Tumor classification;
D O I
10.4108/eai.28-5-2020.166010
中图分类号
学科分类号
摘要
INTRODUCTION: There is a great percentage of failures in clinical trials of early detection of breast cancer. To do this, machine learning (ML) algorithms are useful to do diagnosis and prediction of cancer tumors with better accuracy. OBJECTIVE: In this study, we develop an ML model coupled with limited features to produce high classification accuracy in tumor classification. METHODS: We considered a dataset of 569 females diagnosed as 212 malignant and 357 benign types. For model development, three supervised ML algorithms namely support vector machines (SVM), logistic regression (LR), and K-nearest neighbors (KNN) were employed. Each model was further validated by 10-fold cross-validation and performance measures were defined to evaluate the model outcomes. RESULTS: Both SVM and LR models generated 97.66% accuracy with total feature evaluation. With selective features, the SVM accuracy was improved by 98.25%. Whereas the LR model including limited features produced 100% of true positive predictions. CONCLUSION: The proposed models involved by selective features could improve the prediction accuracy of a breast cancer diagnosis. © 2020 Gopi Battineni et al., licensed to EAI.
引用
收藏
页码:1 / 7
页数:6
相关论文
共 27 条
[1]  
Siegel R., Ward E., Brawley O., Jemal A., Cancer Stastics, 2011, CA cancer J. Clin, (2011)
[2]  
Sun Y. S., Et al., Risk factors and preventions of breast cancer, International Journal of Biological Sciences, (2017)
[3]  
Breast Cancer Estimated Incidence, Mortality and Prevalence Worldwide in 2012, (2012)
[4]  
Berry D. A., Et al., Effect of screening and adjuvant therapy on mortality from breast cancer, N. Engl. J. Med, (2005)
[5]  
Folger P., Geospatial information and geographic information systems (GIS): Current issues and future challenges, Geospatial Information and GIS: Background and Issues, (2011)
[6]  
Asri H., Mousannif H., Al Moatassime H., Noel T., Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis, (2016)
[7]  
Battineni G., Sagaro G. G., Chinatalapudi N., Amenta F., Applications of machine learning predictive models in the chronic disease diagnosis, Journal of Personalized Medicine, (2020)
[8]  
Battineni G., Sagaro G. G., Nalini C., Amenta F., Tayebati S. K., Comparative Machine-Learning Approach: A Follow-Up Study on Type 2 Diabetes Predictions by Cross-Validation Methods, Machines, 7, 4, (2019)
[9]  
Kaur B., Sharma M., Mittal M., Verma A., Goyal L. M., Hemanth D. J., An improved salient object detection algorithm combining background and foreground connectivity for brain image analysis, Comput. Electr. Eng, (2018)
[10]  
Zhou X., Liu K. Y., Wong S. T. C., Cancer classification and prediction using logistic regression with Bayesian gene selection, J. Biomed. Inform, (2004)