An enhanced Predictive heterogeneous ensemble model for breast cancer prediction

被引:37
|
作者
Nanglia, S. [1 ]
Ahmad, Muneer [1 ]
Khan, Fawad Ali [2 ]
Jhanjhi, N. Z. [3 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Comp Syst & Technol, Kuala Lumpur 50603, Malaysia
[3] SCE Taylors Univ, Scool Comp Sci & Engn, Subang Jaya, Malaysia
关键词
Breast cancer; Machine learning; Data mining; Heterogeneous ensemble learning; Homogenous ensemble learning; Meta classifiers; NETWORK;
D O I
10.1016/j.bspc.2021.103279
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Breast Cancer is one of the most prevalent tumors after lung cancer and is common in both women and men. This disease is mostly asymptomatic in the early stages thus detection is difficult, and it becomes complicated and expensive to be treated in later stages resulting in increased fatality rates. There are comparatively very few pieces of literature that investigated breast cancer employing an ensemble learning for cancer prediction as compared to single classifier approaches. This paper presents a heterogeneous ensemble machine learning approach, to detect breast cancer in the early stages. The proposed approach follows the CRISP-DM process and uses Stacking for building the ensemble model using three different algorithms - K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Decision Tree (DT). The performance of this meta classifier is compared with the individual performances of its base classifiers (KNN, SVM, DT) and other single classifiers - Logistic Regression (LR), Artificial Neural Network (ANN), Naive Bayes (NB), Stochastic Gradient Descent (SGD) and a homogenous ensemble model of Random Forest (RF). The top 5 features - Glucose, Resistin, HOMA, Insulin, and BMI are derived by using Chi-Square. Evaluation of the model helps in estimating its consideration for early breast cancer prediction just by using the anthropometric data of humans. Performances of models are compared using metrics such as accuracy, AUC, ROC Curve, f1-score, precision, recall, log loss, and specificity using K-fold cross-validation of 2, 3, 5, 10, and 20 folds. The proposed ensemble model achieved the greatest accuracy of 78 % with the lowest log-loss of 0.56, at K = 20, thus rejecting the Null hypothesis. The derived p-value is 0.014, from the one-tailed t-test, which provides lower significance at proportional to = 0.05.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Predictive breast cancer diagnosis using ensemble fuzzy model
    Yu, Xiaohui
    Tian, Jingjun
    Chen, Zhipeng
    Meng, Yizhen
    Zhang, Jun
    IMAGE AND VISION COMPUTING, 2024, 148
  • [2] Ensemble Machine Learning for Enhanced Breast Cancer Prediction: A Comparative Study
    Rahman, Mijanur
    Kobir, Khandoker Humayoun
    Akther, Sanjana
    Kallol, Abul Hasnat
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 932 - 941
  • [3] A Heterogeneous Ensemble Forecasting Model for Disease Prediction
    Nonita Sharma
    Jaiditya Dev
    Monika Mangla
    Vaishali Mehta Wadhwa
    Sachi Nandan Mohanty
    Deepti Kakkar
    New Generation Computing, 2021, 39 : 701 - 715
  • [4] A Heterogeneous Ensemble Forecasting Model for Disease Prediction
    Sharma, Nonita
    Dev, Jaiditya
    Mangla, Monika
    Wadhwa, Vaishali Mehta
    Mohanty, Sachi Nandan
    Kakkar, Deepti
    NEW GENERATION COMPUTING, 2021, 39 (3-4) : 701 - 715
  • [5] VELM: a voting based ensemble learning model for breast cancer prediction
    Singh, Archana
    Kaswan, Kuldeep Singh
    Rajani
    PHYSICA SCRIPTA, 2025, 100 (02)
  • [6] Early predictive model for breast cancer classification using blended ensemble learning
    T. R. Mahesh
    V. Vinoth Kumar
    V. Vivek
    K. M. Karthick Raghunath
    G. Sindhu Madhuri
    International Journal of System Assurance Engineering and Management, 2024, 15 : 188 - 197
  • [7] Early predictive model for breast cancer classification using blended ensemble learning
    Mahesh, T. R.
    Kumar, V. Vinoth
    Vivek, V.
    Raghunath, K. M. Karthick
    Madhuri, G. Sindhu
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (01) : 188 - 197
  • [8] Prediction of Breast Cancer Using Ensemble Learning
    Das, Sunanda
    Biswas, Dipayan
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2019, : 804 - 808
  • [9] Prediction of Breast Cancer Using Ensemble Learning
    Jayed, Tasfin
    Hasan, Md Al Mehedi
    Masrur, Tahsin
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2019, : 809 - 814
  • [10] An ensemble predictive modeling framework for breast cancer classification
    Nagarajan, Radhakrishnan
    Upreti, Meenakshi
    METHODS, 2017, 131 : 128 - 134