Breast Carcinoma Prediction Through Integration of Machine Learning Models

被引:0
|
作者
Martinez-Licort, Rosmeri [1 ]
Leon, Carlos de la Cruz [2 ,3 ]
Agarwal, Deevyankar [2 ]
Sahelices, Benjamin [1 ]
de la Torre, Isabel [2 ]
Miramontes-Gonzalez, Jose Pablo [4 ,5 ]
Amoon, Mohammed [6 ]
机构
[1] Univ Valladolid, Dept Comp Sci, GCME Res Grp, Valladolid 47011, Spain
[2] Univ Valladolid, Dept Signal Theory Commun & Telemat Engn, Valladolid 47011, Spain
[3] CARTIF Technol Ctr, Valladolid 47151, Spain
[4] Univ Valladolid, Fac Med, Dept Med, Valladolid 47005, Spain
[5] Rio Hortega Univ Hosp, Internal Med Serv, Valladolid 47012, Spain
[6] King Saud Univ, Community Coll, Dept Comp Sci, Riyadh 11437, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Support vector machines; Breast cancer; Data models; Training; Accuracy; Principal component analysis; Analytical models; Ensemble learning; Machine learning; ensemble learning; machine learning; majority voting; principal component analysis; CANCER DIAGNOSIS; SCHEME;
D O I
10.1109/ACCESS.2024.3431998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer poses a global health challenge, with high incidence and mortality rates. Early detection and precise diagnosis are crucial for patient prognosis. Machine learning (ML) models applied to mammary biopsy image data hold promise for achieving an efficient and accurate breast cancer diagnosis. In this study, we evaluated the performance of several ML algorithms, including Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB) and Support Vector Machine (SVM). We establish evaluation contexts by implementing data standardization and reducing the correlation between variables. Firstly, we select the best-performing parameters for each algorithm by building and evaluating the individual models. Then, we implement a combined model using weighted voting, where the weights of each model are determined based on its performance on the test dataset. The final model is constructed by combining the LR, RF and SVM models. We find that SVM is the best-performance individual model, so it has the highest weight in the final model. The final integrated model achieves an accuracy of 98%, a precision of 97%, a recall of 99%, an F1-score of 98% and an AUC of 0.98. Our weighted voting model compares favourably with the other models analysed. This approach demonstrates its efficiency and transparency in handling structured medical data. It is a prototype that will be refined and expanded to encompass larger real-world datasets.
引用
收藏
页码:134635 / 134650
页数:16
相关论文
共 50 条
  • [1] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [2] Enhancing Rice Production Prediction in Indonesia Using Advanced Machine Learning Models
    Erlin
    Yunianta, Arda
    Wulandhari, Lili Ayu
    Desnelita, Yenny
    Nasution, Nurliana
    Junadhi
    IEEE ACCESS, 2024, 12 : 151161 - 151177
  • [3] Explainability of Machine Learning Models for Bankruptcy Prediction
    Park, Min Sue
    Son, Hwijae
    Hyun, Chongseok
    Hwang, Hyung Ju
    IEEE ACCESS, 2021, 9 : 124887 - 124899
  • [4] Enhancing fairness in breast cancer recurrence prediction through temporal machine learning models
    Sundus, Katrina I.
    Hammo, Bassam H.
    Al-Zoubi, Mohammad B.
    Neural Computing and Applications, 2024, 36 (36) : 22697 - 22718
  • [5] Analysis of breast cancer prediction and visualisation using machine learning models
    Magesh G.
    Swarnalatha P.
    International Journal of Cloud Computing, 2022, 11 (01) : 43 - 60
  • [6] Design of an Early Prediction Model for Parkinson's Disease Using Machine Learning
    Velu, K.
    Jaisankar, N.
    IEEE ACCESS, 2025, 13 : 17457 - 17472
  • [7] Osteoporosis, fracture and survival: Application of machine learning in breast cancer prediction models
    Ji, Lichen
    Zhang, Wei
    Zhong, Xugang
    Zhao, Tingxiao
    Sun, Xixi
    Zhu, Senbo
    Tong, Yu
    Luo, Junchao
    Xu, Youjia
    Yang, Di
    Kang, Yao
    Wang, Jin
    Bi, Qing
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [8] Machine learning-based models for the prediction of breast cancer recurrence risk
    Zuo, Duo
    Yang, Lexin
    Jin, Yu
    Qi, Huan
    Liu, Yahui
    Ren, Li
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [9] Application of Machine Learning in Animal Disease Analysis and Prediction
    Zhang, Shuwen
    Su, Qiang
    Chen, Qin
    CURRENT BIOINFORMATICS, 2021, 16 (07) : 972 - 982
  • [10] Machine learning models and bankruptcy prediction
    Barboza, Flavio
    Kimura, Herbert
    Altman, Edward
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 83 : 405 - 417