Comparison of Bayesian network and support vector machine models for two-year survival prediction in lung cancer patients treated with radiotherapy

被引:69
作者
Jayasurya, K. [1 ]
Fung, G. [1 ]
Yu, S. [1 ]
Dehing-Oberije, C. [2 ]
De Ruysscher, D. [2 ]
Hope, A. [3 ]
De Neve, W. [4 ]
Lievens, Y. [5 ]
Lambin, P. [2 ]
Dekker, A. L. A. J. [2 ]
机构
[1] Siemens Med Solut, Malvern, PA 19355 USA
[2] Maastricht Univ, Med Ctr, Dept Radiat Oncol, MAASTRO Clin, NL-6211 HK Maastricht, Netherlands
[3] Univ Hlth Network, Princess Margaret Hosp, Radiat Med Program, Toronto, ON M5G 2MG, Canada
[4] Ghent Univ Hosp, Dept Radiotherapy, B-9000 Ghent, Belgium
[5] Univ Hosp Leuven, Dept Radiotherapy, B-3000 Louvain, Belgium
关键词
belief networks; cancer; learning (artificial intelligence); lung; parameter estimation; positron emission tomography; radiation therapy; support vector machines; tumours; RADIATION-INDUCED PNEUMONITIS; PROGNOSTIC-FACTORS; MISSING VALUES; CLASSIFICATION; STAGE;
D O I
10.1118/1.3352709
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Classic statistical and machine learning models such as support vector machines (SVMs) can be used to predict cancer outcome, but often only perform well if all the input variables are known, which is unlikely in the medical domain. Bayesian network (BN) models have a natural ability to reason under uncertainty and might handle missing data better. In this study, the authors hypothesize that a BN model can predict two-year survival in non-small cell lung cancer (NSCLC) patients as accurately as SVM, but will predict survival more accurately when data are missing. Methods: A BN and SVM model were trained on 322 inoperable NSCLC patients treated with radiotherapy from Maastricht and validated in three independent data sets of 35, 47, and 33 patients from Ghent, Leuven, and Toronto. Missing variables occurred in the data set with only 37, 28, and 24 patients having a complete data set. Results: The BN model structure and parameter learning identified gross tumor volume size, performance status, and number of positive lymph nodes on a PET as prognostic factors for two-year survival. When validated in the full validation set of Ghent, Leuven, and Toronto, the BN model had an AUC of 0.77, 0.72, and 0.70, respectively. A SVM model based on the same variables had an overall worse performance (AUC 0.71, 0.68, and 0.69) especially in the Ghent set, which had the highest percentage of missing the important GTV size data. When only patients with complete data sets were considered, the BN and SVM model performed more alike. Conclusions: Within the limitations of this study, the hypothesis is supported that BN models are better at handling missing data than SVM models and are therefore more suitable for the medical domain. Future works have to focus on improving the BN performance by including more patients, more variables, and more diversity.
引用
收藏
页码:1401 / 1407
页数:7
相关论文
共 20 条
[1]   Prognostic factors in stage III non-small-cell lung cancer [J].
Ademuyiwa, Foluso O. ;
Johnson, Cynthia S. ;
White, Angela S. ;
Breen, Timothy E. ;
Harvey, Jayme ;
Neubauer, Marcus ;
Hanna, Nasser H. .
CLINICAL LUNG CANCER, 2007, 8 (08) :478-482
[2]  
[Anonymous], 2001, COMPUT SCI STAT
[3]   Prognostic factors in non-small cell lung cancer - A decade of progress [J].
Brundage, MD ;
Davies, D ;
Mackillop, WJ .
CHEST, 2002, 122 (03) :1037-1057
[4]   Bayesian networks: Computer-assisted diagnosis support in radiology [J].
Burnside, ES .
ACADEMIC RADIOLOGY, 2005, 12 (04) :422-430
[5]   Investigation of the support vector machine algorithm to predict lung radiation-induced pneumonitis [J].
Chen, Shifeng ;
Zhou, Sumin ;
Yin, Fang-Fang ;
Marks, Lawrence B. ;
Das, Shiva K. .
MEDICAL PHYSICS, 2007, 34 (10) :3808-3814
[6]  
Chickering DM, 2004, J MACH LEARN RES, V5, P1287
[7]  
Christianini N., 2000, INTRO SUPPORT VECTOR, P189
[8]   Combining multiple models to generate consensus: Application to radiation-induced pneumonitis prediction [J].
Das, Shiva K. ;
Chen, Shifeng ;
Deasy, Joseph O. ;
Zhou, Sumin ;
Yin, Fang-Fang ;
Marks, Lawrence B. .
MEDICAL PHYSICS, 2008, 35 (11) :5098-5109
[9]   Tumor volume combined with number of positive lymph node stations is a more important prognostic factor than TNM stage for survival of non-small-cell lung cancer patients treated with (chemo)radiotherapy [J].
Dehing-Oberije, Cary ;
De Ruysscher, Dirk ;
van der Weide, Hiska ;
Hochstenbag, Monique ;
Bootsma, Gerben ;
Geraedts, Wiel ;
Pitz, Cordula ;
Simons, Jean ;
Teule, Jaap ;
Rahmy, Ali ;
Thimister, Paul ;
Steck, Harald ;
Lambin, Philippe .
INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2008, 70 (04) :1039-1044
[10]   DEVELOPMENT AND EXTERNAL VALIDATION OF PROGNOSTIC MODEL FOR 2-YEAR SURVIVAL OF NON-SMALL-CELL LUNG CANCER PATIENTS TREATED WITH CHEMORADIOTHERAPY [J].
Dehing-Oberije, Cary ;
Yu, Shipeng ;
De Ruysscher, Dirk ;
Meersschout, Sabine ;
Van Beek, Karen ;
Lievens, Yolande ;
Van Meerbeeck, Jan ;
De Neve, Wilfried ;
Rao, Bharat ;
van der Weide, Hiska ;
Lambin, Philippe .
INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2009, 74 (02) :355-362