Enhancing fairness in breast cancer recurrence prediction through temporal machine learning models

被引:0
|
作者
Sundus, Katrina I. [1 ]
Hammo, Bassam H. [1 ,2 ]
Al-Zoubi, Mohammad B. [1 ]
机构
[1] King Abdullah II School of Information Technology, The University of Jordan, Amman
[2] King Hussein School of Computing Sciences, Princess Sumaya University for Technology, Amman
关键词
Breast cancer recurrence; Ensemble learning; SMOTE; Temporal data; Under-sampling;
D O I
10.1007/s00521-024-10407-8
中图分类号
学科分类号
摘要
Breast cancer recurrence prediction is a significant challenge in oncology. Advanced methodologies are required to improve prediction accuracy and clinical decision-making. This study presents a novel approach to breast cancer recurrence prediction by integrating machine learning techniques and a hybrid data mining methodology incorporating a temporal dimension into dataset derivation. Our research is based on the Jordan Breast Cancer Dataset (JBRCA), which includes over 44,000 cases spanning 15 years collected from the King Hussein Cancer Center’s registry database in Amman, Jordan. The proposed methodology encompasses data understanding, preparation, and model development stages. We use a thorough data preparation process involving multicollinearity feature selection, feature scaling, and strategic sampling to address dataset challenges. Moreover, we introduce a temporal-derived dataset strategy, dividing the data into four distinct time intervals to capture evolving characteristics and optimize model relevance. We employ diverse base classifiers and ensemble methods to enhance predictive performance in model development. We use evaluation metrics such as accuracy, recall, specificity, G-mean, and ROC-AUC to assess model efficacy across temporal intervals. Our experimental findings reveal significant impacts on classifier performance with temporal dataset derivation, with notable strengths observed in specific classifiers and temporal intervals. For instance, the Naive Bayes model demonstrates efficacy in identifying recurrence cases, while logistic regression exhibits robust performance in ROC-AUC and G-mean metrics. Our study contributes to breast cancer recurrence prediction by introducing a novel methodology that addresses dataset challenges and leverages temporal insights for enhanced predictive accuracy. The findings have a direct impact on clinical practice, providing valuable tools for early detection and improved therapy planning. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:22697 / 22718
页数:21
相关论文
共 50 条
  • [1] Breast Carcinoma Prediction Through Integration of Machine Learning Models
    Martinez-Licort, Rosmeri
    Leon, Carlos de la Cruz
    Agarwal, Deevyankar
    Sahelices, Benjamin
    de la Torre, Isabel
    Miramontes-Gonzalez, Jose Pablo
    Amoon, Mohammed
    IEEE ACCESS, 2024, 12 : 134635 - 134650
  • [2] Ensemble learning method for the prediction of breast cancer recurrence
    Almuhaidib, Daad Abdullah
    Shaiba, Hadil Ahmed
    Alharbi, Najla Ghazi
    Alotaibi, Sara Muhammad
    Albusayyis, Fatima Moteb
    Alzaid, Mashael Abdulalim
    Almadhi, Reem Mohammed
    2018 1ST INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS' 2018), 2018,
  • [3] Enhancing Machine Learning based QoE Prediction by Ensemble Models
    Casas, Pedro
    Seufert, Michael
    Wehner, Nikolas
    Schwind, Anika
    Wamser, Florian
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1642 - 1647
  • [4] Ensemble Machine Learning Models for Breast Cancer Identification
    Dritsas, Elias
    Trigka, Maria
    Mylonas, Phivos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS. AIAI 2023 IFIP WG 12.5 INTERNATIONAL WORKSHOPS, 2023, 677 : 303 - 311
  • [5] Predicting Breast Cancer Recurrence Using Machine Learning Techniques: A Systematic Review
    Abreu, Pedro Henriques
    Santos, Miriam Seoane
    Abreu, Miguel Henriques
    Andrade, Bruno
    Silva, Daniel Castro
    ACM COMPUTING SURVEYS, 2016, 49 (03)
  • [6] A case-based ensemble learning system for explainable breast cancer recurrence prediction
    Gu, Dongxiao
    Su, Kaixiang
    Zhao, Huimin
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 107
  • [7] BIMSSA: enhancing cancer prediction with salp swarm optimization and ensemble machine learning approaches
    Panda, Pinakshi
    Bisoy, Sukant Kishoro
    Panigrahi, Amrutanshu
    Pati, Abhilash
    Sahu, Bibhuprasad
    Guo, Zheshan
    Liu, Haipeng
    Jain, Prince
    FRONTIERS IN GENETICS, 2025, 15
  • [8] Enhancing Monkeypox Detection: A Machine Learning Approach to Symptom Analysis and Disease Prediction
    Magsino, Dea Louisa B.
    Mercado, Russel Lenard O.
    Rivera, Francesca Nicole F.
    Magboo, Ma Sheila A.
    Magboo, Vincent Peter C.
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT I, AIAI 2024, 2024, 711 : 57 - 67
  • [9] Feature fusion based machine learning pipeline to improve breast cancer prediction
    Mishra, Arnab Kumar
    Roy, Pinki
    Bandyopadhyay, Sivaji
    Das, Sujit Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37627 - 37655
  • [10] Enhancing Concrete Workability Prediction Through Ensemble Learning Models: Emphasis on Slump and Material Factors
    Jiang, Jiangsong
    Xin, Chunhong
    Wu, Sifei
    Chen, Wenbing
    Li, Hui
    Ran, Zhaolun
    ADVANCES IN CIVIL ENGINEERING, 2024, 2024