Integrative Stacking Machine Learning Model for Small Cell Lung Cancer Prediction Using Metabolomics Profiling

被引:0
作者
Sumon, Md. Shaheenur Islam [1 ]
Malluhi, Marwan [2 ]
Anan, Noushin [1 ]
Abuhaweeleh, Mohannad Natheef [2 ]
Krzyslak, Hubert [3 ]
Vranic, Semir [2 ]
Chowdhury, Muhammad E. H. [1 ]
Pedersen, Shona [2 ]
机构
[1] Qatar Univ, Dept Elect Engn, Doha 2713, Qatar
[2] Qatar Univ, Coll Med, QU Hlth, Doha 2713, Qatar
[3] Aalborg Univ Hosp, Dept Clin Biochem, DK-9000 Aalborg, Denmark
关键词
SCLC; NSCLC; serum metabolomics; machine learning; stacking ensemble model; GASTRIN-RELEASING PEPTIDE; ENOLASE NSE; MARKERS; METABOLISM; BIOMARKERS; DIAGNOSIS; PROGRP; ROLES; TUMOR;
D O I
10.3390/cancers16244225
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Small cell lung cancer (SCLC) is an extremely aggressive form of lung cancer, characterized by rapid progression and poor survival rates. Despite the importance of early diagnosis, the current diagnostic techniques are invasive and restricted. Methods: This study presents a novel stacking-based ensemble machine learning approach for classifying small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC) using metabolomics data. The analysis included 191 SCLC cases, 173 NSCLC cases, and 97 healthy controls. Feature selection techniques identified significant metabolites, with positive ions proving more relevant. Results: For multi-class classification (control, SCLC, NSCLC), the stacking ensemble achieved 85.03% accuracy and 92.47 AUC using Support Vector Machine (SVM). Binary classification (SCLC vs. NSCLC) further improved performance, with ExtraTreesClassifier reaching 88.19% accuracy and 92.65 AUC. SHapley Additive exPlanations (SHAP) analysis revealed key metabolites like benzoic acid, DL-lactate, and L-arginine as significant predictors. Conclusions: The stacking ensemble approach effectively leverages multiple classifiers to enhance overall predictive performance. The proposed model effectively captures the complementary strengths of different classifiers, enhancing the detection of SCLC and NSCLC. This work accentuates the potential of combining metabolomics with advanced machine learning for non-invasive early lung cancer subtype detection, offering an alternative to conventional biopsy methods.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Lung cancer prediction using machine learning and advanced imaging techniques
    Kadir, Timor
    Gleeson, Fergus
    TRANSLATIONAL LUNG CANCER RESEARCH, 2018, 7 (03) : 304 - 312
  • [32] Diagnostics of Thyroid Cancer Using Machine Learning and Metabolomics
    Kuang, Alyssa
    Kouznetsova, Valentina L.
    Kesari, Santosh
    Tsigelny, Igor F.
    METABOLITES, 2024, 14 (01)
  • [33] Optimized Stacking Ensemble Learning Model for Breast Cancer Detection and Classification Using Machine Learning
    Kumar, Mukesh
    Singhal, Saurabh
    Shekhar, Shashi
    Sharma, Bhisham
    Srivastava, Gautam
    SUSTAINABILITY, 2022, 14 (21)
  • [34] Non-small Cell Lung Cancer Detection Using MicroRNA Expression Profiling of Bronchoalveolar Lavage Fluid and Sputum
    Kim, Julian O.
    Gazala, Sayf
    Razzak, Rene
    Guo, Linghong
    Ghosh, Sunita
    Roa, Wilson H.
    Bedard, Eric L. R.
    ANTICANCER RESEARCH, 2015, 35 (04) : 1873 - 1880
  • [35] Prediction of fluctuations in a chaotic cancer model using machine learning
    Sayari, Elaheh
    da Silva, Sidney T.
    Iarosz, Kelly C.
    Viana, Ricardo L.
    Szezech Jr, Jose D.
    Batista, Antonio M.
    CHAOS SOLITONS & FRACTALS, 2022, 164
  • [36] Untargeted metabolomics and lipidomics identified four subtypes of small cell lung cancer
    Zhang, Chenyue
    Shang, Xiaoling
    Wang, Haiyong
    METABOLOMICS, 2022, 19 (01)
  • [37] Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning
    Lei, Haike
    Zhang, Mengyang
    Wu, Zeyi
    Liu, Chun
    Li, Xiaosheng
    Zhou, Wei
    Long, Bo
    Ma, Jiayang
    Zhang, Huiyi
    Wang, Ying
    Wang, Guixue
    Gong, Mengchun
    Hong, Na
    Liu, Haixia
    Wu, Yongzhong
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [38] Constructing a Risk Prediction Model for Lung Cancer Recurrence by Using Gene Function Clustering and Machine Learning
    Zhong, Jing
    Chen, Jian-Ming
    Chen, Song-Lin
    Yi, Yun-Feng
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2019, 22 (04) : 266 - 275
  • [39] CT Radio Genomics of Non-Small Cell Lung Cancer Using Machine and Deep Learning
    Song, Yiming
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 128 - 139
  • [40] Machine Learning-Enabled Renal Cell Carcinoma Status Prediction Using Multiplatform Urine-Based Metabolomics
    Bifarin, Olatomiwa O.
    Gaul, David A.
    Sah, Samyukta
    Arnold, Rebecca S.
    Ogan, Kenneth
    Master, Viraj A.
    Roberts, David L.
    Bergquist, Sharon H.
    Petros, John A.
    Fernandez, Facundo M.
    Edison, Arthur S.
    JOURNAL OF PROTEOME RESEARCH, 2021, 20 (07) : 3629 - 3641