Comparative Analysis of Breast and Prostate Cancer Prediction Using Machine Learning Techniques

被引:1
作者
Rani, Samta [1 ]
Ahmad, Tanvir [1 ]
Masood, Sarfaraz [1 ]
机构
[1] Jamia Millia Islamia, CSE, Delhi, India
来源
INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1 | 2023年 / 473卷
关键词
Breast cancer; Prostate cancer; Feature selection; Normalization; Classifications;
D O I
10.1007/978-981-19-2821-5_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Around the whole world, cancer is the most life-threatening disease. Basically, cancer can arise in any tissue of the body, and while each variety of cancer has unique characteristics, the fundamental processes that might cause cancer are highly common in all disease types. Breast cancer is one of the most ubiquitous types of cancer in females. In males, prostate cancer is the most dangerous during recent years. This study focuses on breast cancer as well as on prostate cancer in the direction of their early predictions. For early prediction, eight classification models had been used such as logistic regression (LR), Naive Bayes (NB), decision tree (DT), stochastic gradient descent (SGD), K-nearest neighbors (KNN), decision tree (DT), random forest (RF), support vector machine (SVM), and artificial neural network (ANN). This work includes three different datasets for research analysis of breast and prostate cancer predictions. Two datasets for breast cancer (Coimbra and Wisconsin) and one for prostate cancer are taken from UCI and Kaggle repository, respectively. For improving the results of prediction, the normalization technique and feature selection method had been used in this paper. Performance in terms of accuracy, precision, recall, Fl-score, and curves of each classifier are analyzed in this study. Most of the classifiers did well after using the feature selection method (ANOVA). In the case of Breast Cancer Coimbra, KNN give good results with 80% accuracy in both the cases with or without using feature selection. Logistic regression with feature selection doing the best work on Wisconsin Breast Cancer with 99% accuracy. There are four classifiers (SVM, RF, DT, and SGD) which gives highest accuracy (97%) on prostate cancer.
引用
收藏
页码:643 / 650
页数:8
相关论文
共 15 条
[1]   Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis [J].
Asri, Hiba ;
Mousannif, Hajar ;
Al Moatassime, Hassan ;
Noel, Thomas .
7TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2016) / THE 6TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2016) / AFFILIATED WORKSHOPS, 2016, 83 :1064-1069
[2]   Age-specific survival in prostate cancer using machine learning [J].
Doja, M. N. ;
Kaur, Ishleen ;
Ahmad, Tanvir .
DATA TECHNOLOGIES AND APPLICATIONS, 2020, 54 (02) :215-234
[3]   Cross-cancer Prediction: A Novel Machine Learning Approach to Discover Molecular Targets for Development of Treatments for Multiple Cancers [J].
Gao, Katie ;
Wang, Dayong ;
Huang, Yi .
CANCER INFORMATICS, 2018, 17
[4]   Machine learning applications in cancer prognosis and prediction [J].
Kourou, Konstantina ;
Exarchos, Themis P. ;
Exarchos, Konstantinos P. ;
Karamouzis, Michalis V. ;
Fotiadis, Dimitrios I. .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2015, 13 :8-17
[5]  
Masood S, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), P1183, DOI 10.1109/CCAA.2017.8229977
[6]  
Mehdi M, 2019, PROCEEDINGS OF THE 2019 8TH INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART-2019), P155, DOI [10.1109/smart46866.2019.9117466, 10.1109/SMART46866.2019.9117466]
[7]  
Mushtaq Z, 2019, 2019 INT C ENG EM TE, P1, DOI DOI 10.1109/CEET1.2019.8711868
[8]  
Polat K, 2018, 2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT), P315
[9]  
Prabadevi B, 2020, 2020 INT C EM TRENDS, DOI [DOI 10.1109/IC-ETITE47903.2020.36, 10.1109/ic-ETITE47903.2020.36, 10.1109/IC-ETITE47903.2020.36]
[10]   Machine Learning Based Computer Aided Diagnosis of Breast Cancer Utilizing Anthropometric and Clinical Features [J].
Rahman, M. M. ;
Ghasemi, Y. ;
Suley, E. ;
Zhou, Y. ;
Wang, S. ;
Rogers, J. .
IRBM, 2021, 42 (04) :215-226