Breast Carcinoma Prediction Through Integration of Machine Learning Models

被引：0

作者：

Martinez-Licort, Rosmeri ^{[1
]}

Leon, Carlos de la Cruz ^{[2
,3
]}

Agarwal, Deevyankar ^{[2
]}

Sahelices, Benjamin ^{[1
]}

de la Torre, Isabel ^{[2
]}

Miramontes-Gonzalez, Jose Pablo ^{[4
,5
]}

Amoon, Mohammed ^{[6
]}

机构：

[1] Univ Valladolid, Dept Comp Sci, GCME Res Grp, Valladolid 47011, Spain

[2] Univ Valladolid, Dept Signal Theory Commun & Telemat Engn, Valladolid 47011, Spain

[3] CARTIF Technol Ctr, Valladolid 47151, Spain

[4] Univ Valladolid, Fac Med, Dept Med, Valladolid 47005, Spain

[5] Rio Hortega Univ Hosp, Internal Med Serv, Valladolid 47012, Spain

[6] King Saud Univ, Community Coll, Dept Comp Sci, Riyadh 11437, Saudi Arabia

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Support vector machines; Breast cancer; Data models; Training; Accuracy; Principal component analysis; Analytical models; Ensemble learning; Machine learning; ensemble learning; machine learning; majority voting; principal component analysis; CANCER DIAGNOSIS; SCHEME;

D O I：

10.1109/ACCESS.2024.3431998

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Breast cancer poses a global health challenge, with high incidence and mortality rates. Early detection and precise diagnosis are crucial for patient prognosis. Machine learning (ML) models applied to mammary biopsy image data hold promise for achieving an efficient and accurate breast cancer diagnosis. In this study, we evaluated the performance of several ML algorithms, including Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB) and Support Vector Machine (SVM). We establish evaluation contexts by implementing data standardization and reducing the correlation between variables. Firstly, we select the best-performing parameters for each algorithm by building and evaluating the individual models. Then, we implement a combined model using weighted voting, where the weights of each model are determined based on its performance on the test dataset. The final model is constructed by combining the LR, RF and SVM models. We find that SVM is the best-performance individual model, so it has the highest weight in the final model. The final integrated model achieves an accuracy of 98%, a precision of 97%, a recall of 99%, an F1-score of 98% and an AUC of 0.98. Our weighted voting model compares favourably with the other models analysed. This approach demonstrates its efficiency and transparency in handling structured medical data. It is a prototype that will be refined and expanded to encompass larger real-world datasets.

引用

页码：134635 / 134650

页数：16

共 50 条

[21] Prediction of Soil Compaction Parameters Using Machine Learning Models
Li, Bingyi
You, Zixuan
Ni, Kaiwei
Wang, Yuexiang
APPLIED SCIENCES-BASEL, 2024, 14 (07):
[22] Predicting Breast Cancer Relapse from Histopathological Images with Ensemble Machine Learning Models
Sahoo, Ghanashyam
Nayak, Ajit Kumar
Tripathy, Pradyumna Kumar
Panigrahi, Amrutanshu
Pati, Abhilash
Sahu, Bibhuprasad
Mahanty, Chandrakanta
Mallik, Saurav
CURRENT ONCOLOGY, 2024, 31 (11) : 6577 - 6597
[23] Improved prediction of gene expression through integrating cell signalling models with machine learning
Al Taweraqi, Nada
King, Ross D.
BMC BIOINFORMATICS, 2022, 23 (01)
[24] Building Better Models: Prediction, Replication, and Machine Learning in the Social Sciences
Hindman, Matthew
ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 2015, 659 (01) : 48 - 62
[25] Machine Learning and Deep Learning for Loan Prediction in Banking: Exploring Ensemble Methods and Data Balancing
Sayed, Eslam Hussein
Alabrah, Amerah
Rahouma, Kamel Hussein
Zohaib, Muhammad
Badry, Rasha M.
IEEE ACCESS, 2024, 12 : 193997 - 194019
[26] Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks
Tousi, Ashkan
Lujan, Mikel
IEEE ACCESS, 2022, 10 : 11994 - 12011
[27] Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
Ming, Chang
Viassolo, Valeria
Probst-Hensch, Nicole
Chappuis, Pierre O.
Dinov, Ivo D.
Katapodi, Maria C.
BREAST CANCER RESEARCH, 2019, 21 (1)
[28] The Application and Comparison of Machine Learning Models for the Prediction of Breast Cancer Prognosis: Retrospective Cohort Study
Xiao, Jialong
Mo, Miao
Wang, Zezhou
Zhou, Changming
Shen, Jie
Yuan, Jing
He, Yulian
Zheng, Ying
JMIR MEDICAL INFORMATICS, 2022, 10 (02)
[29] Machine learning models in breast cancer survival prediction
Montazeri, Mitra
Montazeri, Mohadeseh
Montazeri, Mahdieh
Beigzadeh, Amin
TECHNOLOGY AND HEALTH CARE, 2016, 24 (01) : 31 - 42
[30] Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
Chang Ming
Valeria Viassolo
Nicole Probst-Hensch
Pierre O. Chappuis
Ivo D. Dinov
Maria C. Katapodi
Breast Cancer Research, 21

← 1 2 3 4 5 →