Integrative Stacking Machine Learning Model for Small Cell Lung Cancer Prediction Using Metabolomics Profiling

被引:0
作者
Sumon, Md. Shaheenur Islam [1 ]
Malluhi, Marwan [2 ]
Anan, Noushin [1 ]
Abuhaweeleh, Mohannad Natheef [2 ]
Krzyslak, Hubert [3 ]
Vranic, Semir [2 ]
Chowdhury, Muhammad E. H. [1 ]
Pedersen, Shona [2 ]
机构
[1] Qatar Univ, Dept Elect Engn, Doha 2713, Qatar
[2] Qatar Univ, Coll Med, QU Hlth, Doha 2713, Qatar
[3] Aalborg Univ Hosp, Dept Clin Biochem, DK-9000 Aalborg, Denmark
关键词
SCLC; NSCLC; serum metabolomics; machine learning; stacking ensemble model; GASTRIN-RELEASING PEPTIDE; ENOLASE NSE; MARKERS; METABOLISM; BIOMARKERS; DIAGNOSIS; PROGRP; ROLES; TUMOR;
D O I
10.3390/cancers16244225
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Small cell lung cancer (SCLC) is an extremely aggressive form of lung cancer, characterized by rapid progression and poor survival rates. Despite the importance of early diagnosis, the current diagnostic techniques are invasive and restricted. Methods: This study presents a novel stacking-based ensemble machine learning approach for classifying small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC) using metabolomics data. The analysis included 191 SCLC cases, 173 NSCLC cases, and 97 healthy controls. Feature selection techniques identified significant metabolites, with positive ions proving more relevant. Results: For multi-class classification (control, SCLC, NSCLC), the stacking ensemble achieved 85.03% accuracy and 92.47 AUC using Support Vector Machine (SVM). Binary classification (SCLC vs. NSCLC) further improved performance, with ExtraTreesClassifier reaching 88.19% accuracy and 92.65 AUC. SHapley Additive exPlanations (SHAP) analysis revealed key metabolites like benzoic acid, DL-lactate, and L-arginine as significant predictors. Conclusions: The stacking ensemble approach effectively leverages multiple classifiers to enhance overall predictive performance. The proposed model effectively captures the complementary strengths of different classifiers, enhancing the detection of SCLC and NSCLC. This work accentuates the potential of combining metabolomics with advanced machine learning for non-invasive early lung cancer subtype detection, offering an alternative to conventional biopsy methods.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Construction of a Diagnostic Model for Small Cell Lung Cancer Combining Metabolomics and Integrated Machine Learning
    Shang, Xiaoling
    Zhang, Chenyue
    Kong, Ronghua
    Zhao, Chenglong
    Wang, Haiyong
    ONCOLOGIST, 2024, 29 (03) : E392 - E401
  • [2] Feasibility of detecting non-small cell lung cancer using exhaled breath condensate metabolomics
    Wang, Sha
    Chu, Heng
    Wang, Guoan
    Zhang, Zhe
    Yin, Shining
    Lu, Jingguang
    Dong, Yuehang
    Zang, Xiaoling
    Lv, Zhihua
    JOURNAL OF BREATH RESEARCH, 2025, 19 (02)
  • [3] Prediction of bone metastasis in non-small cell lung cancer based on machine learning
    Li, Meng-Pan
    Liu, Wen-Cai
    Sun, Bo-Lin
    Zhong, Nan-Shan
    Liu, Zhi-Li
    Huang, Shan-Hu
    Zhang, Zhi-Hong
    Liu, Jia-Ming
    FRONTIERS IN ONCOLOGY, 2023, 12
  • [4] Integrative machine learning model for subtype identification and prognostic prediction in lung squamous cell carcinoma
    Guangliang Duan
    Qi Huo
    Wei Ni
    Fei Ding
    Yuefang Ye
    Tingting Tang
    Huiping Dai
    Discover Oncology, 16 (1)
  • [5] Survival Status Prediction for Non-small Cell Lung Cancer Patients using Machine Learning
    Mohan, Aishwarya
    Jeremic, Aleksandar
    BIOSIGNALS: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 4: BIOSIGNALS, 2022, : 273 - 277
  • [6] Metabolomics profiling in prediction of chemo-immunotherapy efficiency in advanced non-small cell lung cancer
    Mei, Lihong
    Zhang, Zhihua
    Li, Xushuo
    Yang, Ying
    Qi, Ruixue
    FRONTIERS IN ONCOLOGY, 2023, 12
  • [7] Machine Learning-Based Radiomics Signatures for EGFR and KRAS Mutations Prediction in Non-Small-Cell Lung Cancer
    Nguyen Quoc Khanh Le
    Quang Hien Kha
    Van Hiep Nguyen
    Chen, Yung-Chieh
    Cheng, Sho-Jen
    Chen, Cheng-Yu
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (17)
  • [8] Stacking Model for Heart Stroke Prediction using Machine Learning Techniques
    Mohapatra S.
    Mishra I.
    Mohanty S.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2023, 9 (01)
  • [9] Volatile Organic Compounds for the Prediction of Lung Cancer by Using Ensembled Machine Learning Model and Feature Selection
    Khanna, Divya
    Kumar, Arun
    Bhat, Shahid Ahmad
    IEEE ACCESS, 2025, 13 : 9809 - 9820
  • [10] Integration of metabolomics and machine learning revealed tryptophan metabolites are sensitive biomarkers of pemetrexed efficacy in non-small cell lung cancer
    Sun, Runbin
    Fei, Fei
    Wang, Min
    Jiang, Junyi
    Yang, Guangyu
    Yang, Na
    Jin, Dandan
    Xu, Zhi
    Cao, Bei
    Li, Juan
    CANCER MEDICINE, 2023, 12 (18): : 19245 - 19259