Enhancing fairness in breast cancer recurrence prediction through temporal machine learning models

被引:0
|
作者
Sundus, Katrina I. [1 ]
Hammo, Bassam H. [1 ,2 ]
Al-Zoubi, Mohammad B. [1 ]
机构
[1] King Abdullah II School of Information Technology, The University of Jordan, Amman
[2] King Hussein School of Computing Sciences, Princess Sumaya University for Technology, Amman
关键词
Breast cancer recurrence; Ensemble learning; SMOTE; Temporal data; Under-sampling;
D O I
10.1007/s00521-024-10407-8
中图分类号
学科分类号
摘要
Breast cancer recurrence prediction is a significant challenge in oncology. Advanced methodologies are required to improve prediction accuracy and clinical decision-making. This study presents a novel approach to breast cancer recurrence prediction by integrating machine learning techniques and a hybrid data mining methodology incorporating a temporal dimension into dataset derivation. Our research is based on the Jordan Breast Cancer Dataset (JBRCA), which includes over 44,000 cases spanning 15 years collected from the King Hussein Cancer Center’s registry database in Amman, Jordan. The proposed methodology encompasses data understanding, preparation, and model development stages. We use a thorough data preparation process involving multicollinearity feature selection, feature scaling, and strategic sampling to address dataset challenges. Moreover, we introduce a temporal-derived dataset strategy, dividing the data into four distinct time intervals to capture evolving characteristics and optimize model relevance. We employ diverse base classifiers and ensemble methods to enhance predictive performance in model development. We use evaluation metrics such as accuracy, recall, specificity, G-mean, and ROC-AUC to assess model efficacy across temporal intervals. Our experimental findings reveal significant impacts on classifier performance with temporal dataset derivation, with notable strengths observed in specific classifiers and temporal intervals. For instance, the Naive Bayes model demonstrates efficacy in identifying recurrence cases, while logistic regression exhibits robust performance in ROC-AUC and G-mean metrics. Our study contributes to breast cancer recurrence prediction by introducing a novel methodology that addresses dataset challenges and leverages temporal insights for enhanced predictive accuracy. The findings have a direct impact on clinical practice, providing valuable tools for early detection and improved therapy planning. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:22697 / 22718
页数:21
相关论文
共 50 条
  • [41] Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models
    Chang, Victor
    Hall, Karl
    Xu, Qianwen Ariel
    Amao, Folakemi Ololade
    Ganatra, Meghana Ashok
    Benson, Vladlena
    ALGORITHMS, 2024, 17 (06)
  • [42] Enhancing machine learning-based sentiment analysis through feature extraction techniques
    Semary, Noura A.
    Ahmed, Wesam
    Amin, Khalid
    Plawiak, Pawel
    Hammad, Mohamed
    PLOS ONE, 2024, 19 (02):
  • [43] Machine learning models based on bubble analysis for Bitcoin market crash prediction
    Park, Sangjin
    Yang, Jae-Suk
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 135
  • [44] Enhancing crop yield prediction in Senegal using advanced machine learning techniques and synthetic data
    Razavi, Mohammad Amin
    Nejadhashemi, A. Pouyan
    Majidi, Babak
    Razavi, Hoda S.
    Kpodo, Josue
    Eeswaran, Rasu
    Ciampitti, Ignacio
    Prasad, P. V. Vara
    ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2024, 14 : 99 - 114
  • [45] Collaborative threat intelligence: Enhancing IoT security through blockchain and machine learning integration
    Nazir, Ahsan
    He, Jingsha
    Zhu, Nafei
    Wajahat, Ahsan
    Ullah, Faheem
    Qureshi, Sirajuddin
    Ma, Xiangjun
    Pathan, Muhammad Salman
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (02)
  • [46] Breast Cancer Classification Using AdaBoost-Extreme Learning Machine
    Sharifmoghadam, Mahboobe
    Jazayeriy, Hamid
    2019 5TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS 2019), 2019,
  • [47] The Role of Machine Learning in Enhancing Battery Management for Drone Operations: A Focus on SoH Prediction Using Ensemble Learning Techniques
    Cetinus, Buesra
    Oyucu, Saadin
    Aksoz, Ahmet
    Bicer, Emre
    BATTERIES-BASEL, 2024, 10 (10):
  • [48] Ensemble Machine Learning Algorithms for Precision Breast Cancer Diagnosis: A Multi-criteria Evaluation Approach
    Srinivasa Rao Pallapu
    Khasim Syed
    SN Computer Science, 6 (2)
  • [49] Spatial and temporal classification and prediction of aspen probability in boreal forests using machine learning algorithms
    Dmitriy Troshin
    Maksim Fayzulin
    Denis Mirin
    Environmental Monitoring and Assessment, 197 (5)
  • [50] VELM: a voting based ensemble learning model for breast cancer prediction
    Singh, Archana
    Kaswan, Kuldeep Singh
    Rajani
    PHYSICA SCRIPTA, 2025, 100 (02)