Lung Cancer Survival Prediction via Machine Learning Regression, Classification, and Statistical Techniques

被引:0
|
作者
Bartholomai, James A. [1 ]
Frieboes, Hermann B. [1 ]
机构
[1] Univ Louisville, Dept Bioengn, Louisville, KY 40292 USA
来源
2018 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT) | 2018年
关键词
lung cancer; machine learning; supervised classification; SEER database; biomedical big data; PATIENT SURVIVAL; EPIDEMIOLOGY; SURVEILLANCE; PHYSICIANS;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A regression model is developed to predict survival time in months for lung cancer patients. It was previously shown that predictive models perform accurately for short survival times of less than 6 months; however, model accuracy is reduced when attempting to predict longer survival times. This study employs an approach for which regression models are used in combination with a classification model to predict survival time. A set of de-identified lung cancer patient data was obtained from the Surveillance, Epidemiology, and End Results (SEER) database. The models use a subset of factors selected by ANOVA. Model accuracy is measured by a confusion matrix for classification and by Root Mean Square Error (RMSE) for regression. Random Forests are used for classification, while general Linear Regression, Gradient Boosted Machines (GBM), and Random Forests are used for regression. The regression results show that RF had the best performance for survival times <= 6 and > 24 months (RMSE 10.52 and 20.51, respectively), while GBM performed best for 7-24 months (RMSE 15.65). Comparison plots of the results further indicate that the regression models perform better for shorter survival times than the RMSE values are able to reflect.
引用
收藏
页码:632 / 637
页数:6
相关论文
共 50 条
  • [1] Prediction of lung cancer patient survival via supervised machine learning classification techniques
    Lynch, Chip M.
    Abdollahi, Behnaz
    Fuqua, Joshua D.
    de Carlo, Alexandra R.
    Bartholomai, James A.
    Balgemann, Rayeanne N.
    van Berkel, Victor H.
    Frieboes, Hermann B.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 108 : 1 - 8
  • [2] Machine Learning Techniques for Survival Time Prediction in Breast Cancer
    Mihaylov, Iliyan
    Nisheva, Maria
    Vassilev, Dimitar
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 186 - 194
  • [3] A comparison of machine learning techniques for survival prediction in breast cancer
    Leonardo Vanneschi
    Antonella Farinaccio
    Giancarlo Mauri
    Marco Antoniotti
    Paolo Provero
    Mario Giacobini
    BioData Mining, 4
  • [4] A comparison of machine learning techniques for survival prediction in breast cancer
    Vanneschi, Leonardo
    Farinaccio, Antonella
    Mauri, Giancarlo
    Antoniotti, Mauro
    Provero, Paolo
    Giacobini, Mario
    BIODATA MINING, 2011, 4
  • [5] Comparing Statistical and Machine Learning Imputation Techniques in Breast Cancer Classification
    Chlioui, Imane
    Abnane, Ibtissam
    Idri, Ali
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2020, PART IV, 2020, 12252 : 61 - 76
  • [6] Classification Prediction of Lung Cancer Based on Machine Learning Method
    Li, Dantong
    Li, Guixin
    Li, Shuang
    Bang, Ashley
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2024, 19 (01)
  • [7] Improved survival prediction for pancreatic cancer using machine learning and regression
    Floyd, Stuart H.
    Alvarez, Sergio A.
    Ruiz, Carolina
    Hayward, John
    Sullivan, Mary
    Tseng, Jennifer F.
    Whalen, Giles F.
    GASTROENTEROLOGY, 2007, 132 (04) : A869 - A870
  • [8] A systematic review on machine learning and deep learning techniques in cancer survival prediction
    Deepa, P.
    Gunavathi, C.
    PROGRESS IN BIOPHYSICS & MOLECULAR BIOLOGY, 2022, 174 : 62 - 71
  • [9] Prediction of Cervical Cancer Patients' Survival Period with Machine Learning Techniques
    Chanudom, Intorn
    Tharavichitkul, Ekkasit
    Laosiritaworn, Wimalin
    HEALTHCARE INFORMATICS RESEARCH, 2024, 30 (01) : 60 - 72
  • [10] Statistical Evaluation of Survival Rates in Lung Cancer Utilizing Gaussian and Logistic Regression Techniques
    Shankaraiah, Pitta
    Rayalu, G. Mokesh
    CONTEMPORARY MATHEMATICS, 2024, 5 (04): : 5213 - 5230