Optimizing Skin Cancer Survival Prediction with Ensemble Techniques

被引:2
作者
Abbasi, Erum Yousef [1 ]
Deng, Zhongliang [1 ]
Magsi, Arif Hussain [2 ]
Ali, Qasim [3 ]
Kumar, Kamlesh [4 ]
Zubedi, Asma [5 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, State Key Lab Wireless Network Positioning & Commu, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[3] Mehran Univ Engn & Technol, Dept Software Engn, Jamshoro 76062, Pakistan
[4] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[5] Beijing Univ Posts & Telecommun, Sch Econ & Management, Beijing 100876, Peoples R China
来源
BIOENGINEERING-BASEL | 2024年 / 11卷 / 01期
关键词
skin cancer; melanoma; machine learning; ensemble technique; feature selection;
D O I
10.3390/bioengineering11010043
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The advancement in cancer research using high throughput technology and artificial intelligence (AI) is gaining momentum to improve disease diagnosis and targeted therapy. However, the complex and imbalanced data with high dimensionality pose significant challenges for computational approaches and multi-omics data analysis. This study focuses on predicting skin cancer and analyzing overall survival probability. We employ the Kaplan-Meier estimator and Cox proportional hazards regression model, utilizing high-throughput machine learning (ML)-based ensemble methods. Our proposed ML-based ensemble techniques are applied to a publicly available dataset from the ICGC Data Portal, specifically targeting skin cutaneous melanoma cancers (SKCM). We used eight baseline classifiers, namely, random forest (RF), decision tree (DT), gradient boosting (GB), AdaBoost, Gaussian naive Bayes (GNB), extra tree (ET), logistic regression (LR), and light gradient boosting machine (Light GBM or LGBM). The study evaluated the performance of the proposed ensemble methods and survival analysis on SKCM. The proposed methods demonstrated promising results, outperforming other algorithms and models in terms of accuracy compared to traditional methods. Specifically, the RF classifier exhibited outstanding precision results. Additionally, four different ensemble methods (stacking, bagging, boosting, and voting) were created and trained to achieve optimal results. The performance was evaluated and interpreted using accuracy, precision, recall, F1 score, confusion matrix, and ROC curves, where the voting method achieved a promising accuracy of 99%. On the other hand, the RF classifier achieved an outstanding accuracy of 99%, which exhibits the best performance. We compared our proposed study with the existing state-of-the-art techniques and found significant improvements in several key aspects. Our approach not only demonstrated superior performance in terms of accuracy but also showcased remarkable efficiency. Thus, this research work contributes to diagnosing SKCM with high accuracy.
引用
收藏
页数:26
相关论文
共 38 条
[1]   Predicting Breast Cancer Leveraging Supervised Machine Learning Techniques [J].
Aamir, Sanam ;
Rahim, Aqsa ;
Aamir, Zain ;
Abbasi, Saadullah Farooq ;
Khan, Muhammad Shahbaz ;
Alhaisoni, Majed ;
Khan, Muhammad Attique ;
Khan, Khyber ;
Ahmad, Jawad .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
[2]   Methods to Analyze Time-to-Event Data: The Cox Regression Analysis [J].
Abd ElHafeez, Samar ;
D'Arrigo, Graziella ;
Leonardis, Daniela ;
Fusaro, Maria ;
Tripepi, Giovanni ;
Roumeliotis, Stefanos .
OXIDATIVE MEDICINE AND CELLULAR LONGEVITY, 2021, 2021
[3]  
Agrahari P., 2022, Futuristic Communication and Network Technologies, P179, DOI DOI 10.1007/978-981-16-4625-618
[4]   An Efficient Deep Learning-Based Skin Cancer Classifier for an Imbalanced Dataset [J].
Alam, Talha Mahboob ;
Shaukat, Kamran ;
Khan, Waseem Ahmad ;
Hameed, Ibrahim A. ;
Abd Almuqren, Latifah ;
Raza, Muhammad Ahsan ;
Aslam, Memoona ;
Luo, Suhuai .
DIAGNOSTICS, 2022, 12 (09)
[5]   An enhanced technique of skin cancer classification using deep convolutional neural network with transfer learning models [J].
Ali, Md Shahin ;
Miah, Md Sipon ;
Haque, Jahurul ;
Rahman, Md Mahbubur ;
Islam, Md Khairul .
MACHINE LEARNING WITH APPLICATIONS, 2021, 5
[6]   Melanoma Detection Using Deep Learning-Based Classifications [J].
Alwakid, Ghadah ;
Gouda, Walaa ;
Humayun, Mamoona ;
Sama, Najm Us .
HEALTHCARE, 2022, 10 (12)
[7]   Survival Analysis Part II: Multivariate data analysis - an introduction to concepts and methods [J].
Bradburn, MJ ;
Clark, TG ;
Love, SB ;
Altman, DG .
BRITISH JOURNAL OF CANCER, 2003, 89 (03) :431-436
[8]   Epidemiology and Risk Factors of Melanoma [J].
Carr, Stephanie ;
Smith, Christy ;
Wernberg, Jessica .
SURGICAL CLINICS OF NORTH AMERICA, 2020, 100 (01) :1-+
[9]   Variable selection strategies and its importance in clinical prediction modelling [J].
Chowdhury, Mohammad Ziaul Islam ;
Turin, Tanvir C. .
FAMILY MEDICINE AND COMMUNITY HEALTH, 2020, 8 (01)
[10]   Machine learning to predict overall short-term mortality in cutaneous melanoma [J].
Cozzolino, C. ;
Buja, A. ;
Rugge, M. ;
Miatton, A. ;
Zorzi, M. ;
Vecchiato, A. ;
Del Fiore, P. ;
Tropea, S. ;
Brazzale, A. ;
Damiani, G. ;
dall'Olmo, L. ;
Rossi, C. R. ;
Mocellin, S. .
DISCOVER ONCOLOGY, 2023, 14 (01)