Breast Carcinoma Prediction Through Integration of Machine Learning Models

被引:0
作者
Martinez-Licort, Rosmeri [1 ]
Leon, Carlos de la Cruz [2 ,3 ]
Agarwal, Deevyankar [2 ]
Sahelices, Benjamin [1 ]
de la Torre, Isabel [2 ]
Miramontes-Gonzalez, Jose Pablo [4 ,5 ]
Amoon, Mohammed [6 ]
机构
[1] Univ Valladolid, Dept Comp Sci, GCME Res Grp, Valladolid 47011, Spain
[2] Univ Valladolid, Dept Signal Theory Commun & Telemat Engn, Valladolid 47011, Spain
[3] CARTIF Technol Ctr, Valladolid 47151, Spain
[4] Univ Valladolid, Fac Med, Dept Med, Valladolid 47005, Spain
[5] Rio Hortega Univ Hosp, Internal Med Serv, Valladolid 47012, Spain
[6] King Saud Univ, Community Coll, Dept Comp Sci, Riyadh 11437, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Support vector machines; Breast cancer; Data models; Training; Accuracy; Principal component analysis; Analytical models; Ensemble learning; Machine learning; ensemble learning; machine learning; majority voting; principal component analysis; CANCER DIAGNOSIS; SCHEME;
D O I
10.1109/ACCESS.2024.3431998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer poses a global health challenge, with high incidence and mortality rates. Early detection and precise diagnosis are crucial for patient prognosis. Machine learning (ML) models applied to mammary biopsy image data hold promise for achieving an efficient and accurate breast cancer diagnosis. In this study, we evaluated the performance of several ML algorithms, including Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB) and Support Vector Machine (SVM). We establish evaluation contexts by implementing data standardization and reducing the correlation between variables. Firstly, we select the best-performing parameters for each algorithm by building and evaluating the individual models. Then, we implement a combined model using weighted voting, where the weights of each model are determined based on its performance on the test dataset. The final model is constructed by combining the LR, RF and SVM models. We find that SVM is the best-performance individual model, so it has the highest weight in the final model. The final integrated model achieves an accuracy of 98%, a precision of 97%, a recall of 99%, an F1-score of 98% and an AUC of 0.98. Our weighted voting model compares favourably with the other models analysed. This approach demonstrates its efficiency and transparency in handling structured medical data. It is a prototype that will be refined and expanded to encompass larger real-world datasets.
引用
收藏
页码:134635 / 134650
页数:16
相关论文
共 50 条
  • [21] Prediction of Soil Compaction Parameters Using Machine Learning Models
    Li, Bingyi
    You, Zixuan
    Ni, Kaiwei
    Wang, Yuexiang
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [22] Predicting Breast Cancer Relapse from Histopathological Images with Ensemble Machine Learning Models
    Sahoo, Ghanashyam
    Nayak, Ajit Kumar
    Tripathy, Pradyumna Kumar
    Panigrahi, Amrutanshu
    Pati, Abhilash
    Sahu, Bibhuprasad
    Mahanty, Chandrakanta
    Mallik, Saurav
    CURRENT ONCOLOGY, 2024, 31 (11) : 6577 - 6597
  • [23] Improved prediction of gene expression through integrating cell signalling models with machine learning
    Al Taweraqi, Nada
    King, Ross D.
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [24] Building Better Models: Prediction, Replication, and Machine Learning in the Social Sciences
    Hindman, Matthew
    ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 2015, 659 (01) : 48 - 62
  • [25] Machine Learning and Deep Learning for Loan Prediction in Banking: Exploring Ensemble Methods and Data Balancing
    Sayed, Eslam Hussein
    Alabrah, Amerah
    Rahouma, Kamel Hussein
    Zohaib, Muhammad
    Badry, Rasha M.
    IEEE ACCESS, 2024, 12 : 193997 - 194019
  • [26] Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks
    Tousi, Ashkan
    Lujan, Mikel
    IEEE ACCESS, 2022, 10 : 11994 - 12011
  • [27] Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
    Ming, Chang
    Viassolo, Valeria
    Probst-Hensch, Nicole
    Chappuis, Pierre O.
    Dinov, Ivo D.
    Katapodi, Maria C.
    BREAST CANCER RESEARCH, 2019, 21 (1)
  • [28] The Application and Comparison of Machine Learning Models for the Prediction of Breast Cancer Prognosis: Retrospective Cohort Study
    Xiao, Jialong
    Mo, Miao
    Wang, Zezhou
    Zhou, Changming
    Shen, Jie
    Yuan, Jing
    He, Yulian
    Zheng, Ying
    JMIR MEDICAL INFORMATICS, 2022, 10 (02)
  • [29] Machine learning models in breast cancer survival prediction
    Montazeri, Mitra
    Montazeri, Mohadeseh
    Montazeri, Mahdieh
    Beigzadeh, Amin
    TECHNOLOGY AND HEALTH CARE, 2016, 24 (01) : 31 - 42
  • [30] Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
    Chang Ming
    Valeria Viassolo
    Nicole Probst-Hensch
    Pierre O. Chappuis
    Ivo D. Dinov
    Maria C. Katapodi
    Breast Cancer Research, 21