Breast Cancer Prediction using Machine Learning Models

被引:0
作者
Iparraguirre-Villanueva, Orlando [1 ]
Epifania-Huerta, Andres [2 ]
Torres-Ceclen, Carmen [3 ]
Ruiz-Alvarado, John [4 ]
Cabanillas-Carbonell, Michael [5 ]
机构
[1] Univ Norbert Wiener, Fac Ingn & Negocios, Lima, Peru
[2] Univ Nacl San Martin, Fac Ingn Sistemas, Tarapoto, Peru
[3] Univ Catolica Angeles Chimbote, Fac Ingn, Chimbote, Peru
[4] Univ Tecnol Peru, Fac Ingn, Lima, Peru
[5] Univ Privada Norte, Fac Ingn, Lima, Peru
关键词
-Prediction; models; machine learning; cells; breast cancer;
D O I
10.14569/IJACSA.2023.0140272
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Breast cancer is a type of cancer that develops in the cells of the breast. Treatment for breast cancer usually involves X-ray, chemotherapy, or a combination of both treatments. Detecting cancer at an early stage can save a person's life. Artificial intelligence (AI) plays a very important role in this area. Therefore, predicting breast cancer remains a very challenging issue for clinicians and researchers. This work aims to predict the probability of breast cancer in patients. Using machine learning (ML) models such as Multilayer Perceptron (MLP), K-Nearest Neightbot (KNN), AdaBoost (AB), Bagging, Gradient Boosting (GB), and Random Forest (RF). The breast cancer diagnostic medical dataset from the Wisconsin repository has been used. The dataset includes 569 observations and 32 features. Following the data analysis methodology, data cleaning, exploratory analysis, training, testing, and validation were performed. The performance of the models was evaluated with the parameters: classification accuracy, specificity, sensitivity, F1 count, and precision. The training and results indicate that the six trained models can provide optimal classification and prediction results. The RF, GB, and AB models achieved 100% accuracy, outperforming the other models. Therefore, the suggested models for breast cancer identification, classification, and prediction are RF, GB, and AB. Likewise, the Bagging, KNN, and MLP models achieved a performance of 99.56%, 95.82%, and 96.92%, respectively. Similarly, the last three models achieved an optimal yield close to 100%. Finally, the results show a clear advantage of the RF, GB, and AB models, as they achieve more accurate results in breast cancer prediction.
引用
收藏
页码:610 / 620
页数:11
相关论文
共 46 条
[1]   Application of noise-reduction techniques to machine learning algorithms for breast cancer tumor identification [J].
Ahuja, Avani ;
Al-Zogbi, Lidia ;
Krieger, Axel .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
[2]   Age and Gender Classification Using Backpropagation and Bagging Algorithms [J].
Almomani, Ammar ;
Alweshah, Mohammed ;
Alomoush, Waleed ;
Alauthman, Mohammad ;
Jabai, Aseel ;
Abbass, Anwar ;
Hamad, Ghufran ;
Abdalla, Meral ;
Gupta, Brij B. .
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02) :3045-3062
[3]   Predicting the recurrence of breast cancer using machine learning algorithms [J].
Alzu'bi, Amal ;
Najadat, Hassan ;
Doulat, Wesam ;
Al-Shari, Osama ;
Zhou, Leming .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) :13787-13800
[4]  
American Cancer Society, 2020, WHAT BREAST CANC
[5]   Multi-modal advanced deep learning architectures for breast cancer survival prediction [J].
Arya, Nikhilanand ;
Saha, Sriparna .
KNOWLEDGE-BASED SYSTEMS, 2021, 221
[7]   MTRRE-Net: A deep learning model for detection of breast cancer from histopathological images [J].
Chattopadhyay, Soham ;
Dey, Arijit ;
Singh, Pawan Kumar ;
Oliva, Diego ;
Cuevas, Erik ;
Sarkar, Ram .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
[8]   Analysis of the TP53 Deleterious Single Nucleotide Polymorphisms Impact on Estrogen Receptor Alpha-p53 Interaction: A Machine Learning Approach [J].
Chitrala, Kumaraswamy Naidu ;
Nagarkatti, Mitzi ;
Nagarkatti, Prakash ;
Yeguvapalli, Suneetha .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (12)
[9]   Breast Cancer Tumor Classification Using a Bag of Deep Multi-Resolution Convolutional Features [J].
Clement, David ;
Agu, Emmanuel ;
Obayemi, John ;
Adeshina, Steve ;
Soboyejo, Wole .
INFORMATICS-BASEL, 2022, 9 (04)
[10]   A hybrid approach for Bangla sign language recognition using deep transfer learning model with random forest classifier [J].
Das, Sunanda ;
Imtiaz, Md. Samir ;
Neom, Nieb Hasan ;
Siddique, Nazmul ;
Wang, Hui .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213