Enhancing fairness in breast cancer recurrence prediction through temporal machine learning models

被引:0
|
作者
Sundus, Katrina I. [1 ]
Hammo, Bassam H. [1 ,2 ]
Al-Zoubi, Mohammad B. [1 ]
机构
[1] King Abdullah II School of Information Technology, The University of Jordan, Amman
[2] King Hussein School of Computing Sciences, Princess Sumaya University for Technology, Amman
关键词
Breast cancer recurrence; Ensemble learning; SMOTE; Temporal data; Under-sampling;
D O I
10.1007/s00521-024-10407-8
中图分类号
学科分类号
摘要
Breast cancer recurrence prediction is a significant challenge in oncology. Advanced methodologies are required to improve prediction accuracy and clinical decision-making. This study presents a novel approach to breast cancer recurrence prediction by integrating machine learning techniques and a hybrid data mining methodology incorporating a temporal dimension into dataset derivation. Our research is based on the Jordan Breast Cancer Dataset (JBRCA), which includes over 44,000 cases spanning 15 years collected from the King Hussein Cancer Center’s registry database in Amman, Jordan. The proposed methodology encompasses data understanding, preparation, and model development stages. We use a thorough data preparation process involving multicollinearity feature selection, feature scaling, and strategic sampling to address dataset challenges. Moreover, we introduce a temporal-derived dataset strategy, dividing the data into four distinct time intervals to capture evolving characteristics and optimize model relevance. We employ diverse base classifiers and ensemble methods to enhance predictive performance in model development. We use evaluation metrics such as accuracy, recall, specificity, G-mean, and ROC-AUC to assess model efficacy across temporal intervals. Our experimental findings reveal significant impacts on classifier performance with temporal dataset derivation, with notable strengths observed in specific classifiers and temporal intervals. For instance, the Naive Bayes model demonstrates efficacy in identifying recurrence cases, while logistic regression exhibits robust performance in ROC-AUC and G-mean metrics. Our study contributes to breast cancer recurrence prediction by introducing a novel methodology that addresses dataset challenges and leverages temporal insights for enhanced predictive accuracy. The findings have a direct impact on clinical practice, providing valuable tools for early detection and improved therapy planning. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:22697 / 22718
页数:21
相关论文
共 50 条
  • [21] Machine learning models for chronic kidney disease diagnosis and prediction
    Rahman, Md. Mustafizur
    Al-Amin, Md.
    Hossain, Jahangir
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
  • [22] Prediction of Soil Compaction Parameters Using Machine Learning Models
    Li, Bingyi
    You, Zixuan
    Ni, Kaiwei
    Wang, Yuexiang
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [23] A hybrid multi-stage learning technique based on brain storming optimization algorithm for breast cancer recurrence prediction
    Alwohaibi, Maram
    Alzaqebah, Malek
    Alotaibi, Noura M.
    Alzahrani, Abeer M.
    Zouch, Mariem
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5192 - 5203
  • [24] Enhancing Breast Cancer Detection and Classification Using Advanced Multi-Model Features and Ensemble Machine Learning Techniques
    Al Reshan, Mana Saleh
    Amin, Samina
    Zeb, Muhammad Ali
    Sulaiman, Adel
    Alshahrani, Hani
    Azar, Ahmad Taher
    Shaikh, Asadullah
    LIFE-BASEL, 2023, 13 (10):
  • [25] Early Prediction of Breast Cancer Recurrence for Patients Treated with Neoadjuvant Chemotherapy: A Transfer Learning Approach on DCE-MRIs
    Comes, Maria Colomba
    La Forgia, Daniele
    Didonna, Vittorio
    Fanizzi, Annarita
    Giotta, Francesco
    Latorre, Agnese
    Martinelli, Eugenio
    Mencattini, Arianna
    Paradiso, Angelo Virgilio
    Tamborra, Pasquale
    Terenzio, Antonella
    Zito, Alfredo
    Lorusso, Vito
    Massafra, Raffaella
    CANCERS, 2021, 13 (10)
  • [26] Building Better Models: Prediction, Replication, and Machine Learning in the Social Sciences
    Hindman, Matthew
    ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 2015, 659 (01) : 48 - 62
  • [27] Quality prediction through machine learning for the inspection and manufacturing process of blood glucose test strips
    Tsou, Ching-Shih
    Liou, Christine
    Cheng, Longsheng
    Zhou, Hanting
    COGENT ENGINEERING, 2022, 9 (01):
  • [28] Performance Analysis of Machine Learning Centered Workload Prediction Models for Cloud
    Saxena, Deepika
    Kumar, Jitendra
    Singh, Ashutosh Kumar
    Schmid, Stefan
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (04) : 1313 - 1330
  • [29] Ensemble machine learning models for prediction of flyrock due to quarry blasting
    Barkhordari, M. S.
    Armaghani, D. J.
    Fakharian, P.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2022, 19 (09) : 8661 - 8676
  • [30] Ensemble machine learning models for prediction of flyrock due to quarry blasting
    M. S. Barkhordari
    D. J. Armaghani
    P. Fakharian
    International Journal of Environmental Science and Technology, 2022, 19 : 8661 - 8676