Lung Cancer Survival Prediction via Machine Learning Regression, Classification, and Statistical Techniques

被引:0
作者
Bartholomai, James A. [1 ]
Frieboes, Hermann B. [1 ]
机构
[1] Univ Louisville, Dept Bioengn, Louisville, KY 40292 USA
来源
2018 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT) | 2018年
关键词
lung cancer; machine learning; supervised classification; SEER database; biomedical big data; PATIENT SURVIVAL; EPIDEMIOLOGY; SURVEILLANCE; PHYSICIANS;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A regression model is developed to predict survival time in months for lung cancer patients. It was previously shown that predictive models perform accurately for short survival times of less than 6 months; however, model accuracy is reduced when attempting to predict longer survival times. This study employs an approach for which regression models are used in combination with a classification model to predict survival time. A set of de-identified lung cancer patient data was obtained from the Surveillance, Epidemiology, and End Results (SEER) database. The models use a subset of factors selected by ANOVA. Model accuracy is measured by a confusion matrix for classification and by Root Mean Square Error (RMSE) for regression. Random Forests are used for classification, while general Linear Regression, Gradient Boosted Machines (GBM), and Random Forests are used for regression. The regression results show that RF had the best performance for survival times <= 6 and > 24 months (RMSE 10.52 and 20.51, respectively), while GBM performed best for 7-24 months (RMSE 15.65). Comparison plots of the results further indicate that the regression models perform better for shorter survival times than the RMSE values are able to reflect.
引用
收藏
页码:632 / 637
页数:6
相关论文
共 50 条
  • [31] Enhancing Lung Cancer Classification through Integration of Liquid Biopsy Multi-Omics Data with Machine Learning Techniques
    Kwon, Hyuk-Jung
    Park, Ui-Hyun
    Goh, Chul Jun
    Park, Dabin
    Lim, Yu Gyeong
    Lee, Isaac Kise
    Do, Woo-Jung
    Lee, Kyoung Joo
    Kim, Hyojung
    Yun, Seon-Young
    Joo, Joungsu
    Min, Na Young
    Lee, Sunghoon
    Um, Sang-Won
    Lee, Min-Seob
    CANCERS, 2023, 15 (18)
  • [32] Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models
    Kwon, Oh Beom
    Han, Solji
    Lee, Hwa Young
    Kang, Hye Seon
    Kim, Sung Kyoung
    Kim, Ju Sang
    Park, Chan Kwon
    Lee, Sang Haak
    Kim, Seung Joon
    Kim, Jin Woo
    Yeo, Chang Dong
    TUBERCULOSIS AND RESPIRATORY DISEASES, 2023, 86 (03) : 203 - 215
  • [33] Predicting Postoperative Lung Cancer Recurrence and Survival Using Cox Proportional Hazards Regression and Machine Learning
    Pu, Lucy
    Dhupar, Rajeev
    Meng, Xin
    CANCERS, 2025, 17 (01)
  • [34] Prediction of lung cancer immunotherapy response via machine learning analysis of immune cell lineage and surface markers
    Mueller, Alex N.
    Morrisey, Samantha
    Miller, Hunter A.
    Hu, Xiaoling
    Kumar, Rohit
    Ngo, Phuong T.
    Yan, Jun
    Frieboes, Hermann B.
    CANCER BIOMARKERS, 2022, 34 (04) : 681 - 692
  • [35] Prediction of Recurrence in Non Small Cell Lung Cancer Patients with Gene Expression Data Using Machine Learning Techniques
    Bhattacharjee, Sudipto
    Saha, Banani
    Saha, Sudipto
    2023 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL & COMMUNICATION ENGINEERING, ICCECE, 2023,
  • [36] Lung cancer survival prediction and biomarker identification with an ensemble machine learning analysis of tumor core biopsy metabolomic data
    Miller, Hunter A.
    van Berkel, Victor H.
    Frieboes, Hermann B.
    METABOLOMICS, 2022, 18 (08)
  • [37] Lung cancer survival prediction and biomarker identification with an ensemble machine learning analysis of tumor core biopsy metabolomic data
    Hunter A. Miller
    Victor H. van Berkel
    Hermann B. Frieboes
    Metabolomics, 18
  • [38] DEEP LEARNING SURVIVAL PREDICTION FOR LUNG CANCER PATIENTS
    Huang, Caiyun
    Yin, Changhua
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2021, 33 (04):
  • [39] Machine Learning Techniques for Tomato Plant Diseases Clustering, Prediction and Classification
    Qasrawi, Radwan
    Amro, Malak
    Zaghal, Raid
    Sawafteh, Mohammad
    Polo, Stephanny Vicuna
    2021 INTERNATIONAL CONFERENCE ON PROMISING ELECTRONIC TECHNOLOGIES (ICPET 2021), 2021, : 40 - 45
  • [40] Customer purchasing behavior prediction using machine learning classification techniques
    Chaubey G.
    Gavhane P.R.
    Bisen D.
    Arjaria S.K.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (12): : 16133 - 16157