Machine Learning Algorithms Outperform Conventional Regression Models in Predicting Development of Hepatocellular Carcinoma

被引:213
作者
Singal, Amit G. [1 ,2 ,3 ]
Mukherjee, Ashin [4 ]
Elmunzer, B. Joseph [5 ]
Higgins, Peter D. R. [5 ]
Lok, Anna S. [5 ]
Zhu, Ji [1 ,4 ]
Marrero, Jorge A. [1 ]
Waljee, Akbar K. [5 ,6 ]
机构
[1] UT Southwestern Med Ctr, Dept Internal Med, Dallas, TX USA
[2] Univ Texas Southwestern, Dept Clin Sci, Dallas, TX USA
[3] UT Southwestern Med Ctr, Harold C Simmons Canc Ctr, Dallas, TX USA
[4] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Dept Internal Med, Ann Arbor, MI 48109 USA
[6] Vet Affairs Ctr Clin Management Res, Ann Arbor, MI USA
关键词
ALPHA-FETOPROTEIN; HEPATITIS-C; SURVEILLANCE; CURVE;
D O I
10.1038/ajg.2013.332
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
OBJECTIVES: Predictive models for hepatocellular carcinoma (HCC) have been limited by modest accuracy and lack of validation. Machine-learning algorithms offer a novel methodology, which may improve HCC risk prognostication among patients with cirrhosis. Our study's aim was to develop and compare predictive models for HCC development among cirrhotic patients, using conventional regression analysis and machine-learning algorithms. METHODS: We enrolled 442 patients with Child A or B cirrhosis at the University of Michigan between January 2004 and September 2006 (UM cohort) and prospectively followed them until HCC development, liver transplantation, death, or study termination. Regression analysis and machine-learning algorithms were used to construct predictive models for HCC development, which were tested on an independent validation cohort from the Hepatitis C Antiviral Long-term Treatment against Cirrhosis (HALT-C) Trial. Both models were also compared with the previously published HALT-C model. Discrimination was assessed using receiver operating characteristic curve analysis, and diagnostic accuracy was assessed with net reclassification improvement and integrated discrimination improvement statistics. RESULTS: After a median follow-up of 3.5 years, 41 patients developed HCC. The UM regression model had a c-statistic of 0.61 (95% confidence interval (CI) 0.56-0.67), whereas the machine-learning algorithm had a c-statistic of 0.64 (95% CI 0.60-0.69) in the validation cohort. The HALT-C model had a c-statistic of 0.60 (95% CI 0.50-0.70) in the validation cohort and was outperformed by the machine-learning algorithm. The machine-learning algorithm had significantly better diagnostic accuracy as assessed by net reclassification improvement (P<0.001) and integrated discrimination improvement (P=0.04). CONCLUSIONS: Machine-learning algorithms improve the accuracy of risk stratifying patients with cirrhosis and can be used to accurately identify patients at high-risk for developing HCC.
引用
收藏
页码:1723 / 1730
页数:8
相关论文
共 26 条
[1]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[2]   Management of hepatoceullular carcinoma [J].
Bruix, J ;
Sherman, M .
HEPATOLOGY, 2005, 42 (05) :1208-1236
[3]   Use and misuse of the receiver operating characteristic curve in risk prediction [J].
Cook, Nancy R. .
CIRCULATION, 2007, 115 (07) :928-935
[4]   Prolonged Therapy of Advanced Chronic Hepatitis C with Low-Dose Peginterferon [J].
Di Bisceglie, Adrian M. ;
Shiffman, Mitchell L. ;
Everson, Gregory T. ;
Lindsay, Karen L. ;
Everhart, James E. ;
Wright, Elizabeth C. ;
Lee, William M. ;
Lok, Anna S. ;
Bonkovsky, Herbert L. ;
Morgan, Timothy R. ;
Ghany, Marc G. ;
Morishima, Chihiro ;
Snow, Kristin K. ;
Dienstag, Jules L. .
NEW ENGLAND JOURNAL OF MEDICINE, 2008, 359 (23) :2429-2441
[5]   Hepatocellular carcinoma: Epidemiology and molecular carcinogenesis [J].
El-Serag, Hashem B. ;
Rudolph, Lenhard .
GASTROENTEROLOGY, 2007, 132 (07) :2557-2576
[6]   Hepatocellular carcinoma in cirrhosis: Incidence and risk factors [J].
Fattovich, G ;
Stroffolini, T ;
Zagni, I ;
Donato, F .
GASTROENTEROLOGY, 2004, 127 (05) :S35-S50
[7]   Disease-Free Survival after Hepatic Resection in Hepatocellular Carcinoma Patients: A Prediction Approach Using Artificial Neural Network [J].
Ho, Wen-Hsien ;
Lee, King-Teh ;
Chen, Hong-Yaw ;
Ho, Te-Wei ;
Chiu, Herng-Chia .
PLOS ONE, 2012, 7 (01)
[8]   Improving Screening for Hepatocellular Carcinoma by Incorporating Data on Levels of α-Fetoprotein, Over Time [J].
Lee, Elliot ;
Edward, Selwan ;
Singal, Amit G. ;
Lavieri, Mariel S. ;
Volk, Michael .
CLINICAL GASTROENTEROLOGY AND HEPATOLOGY, 2013, 11 (04) :437-440
[9]   Natural history of untreated nonsurgical hepatocellular carcinoma:: Rationale for the design and evaluation of therapeutic trials [J].
Llovet, JM ;
Bustamante, J ;
Castells, A ;
Vilana, R ;
Ayuso, MD ;
Sala, M ;
Brú, C ;
Rodés, J ;
Bruix, J .
HEPATOLOGY, 1999, 29 (01) :62-67
[10]   Incidence of Hepatocellular Carcinoma and Associated Risk Factors in Hepatitis C-Related Advanced Liver Disease [J].
Lok, Anna S. ;
Seeff, Leonard B. ;
Morgan, Timothy R. ;
Di Bisceglie, Adrian M. ;
Sterling, Richard K. ;
Curto, Teresa M. ;
Everson, Gregory T. ;
Lindsay, Karen L. ;
Lee, William M. ;
Bonkovsky, Herbert L. ;
Dienstag, Jules L. ;
Ghany, Marc G. ;
Morishima, Chihiro ;
Goodman, Zachary D. .
GASTROENTEROLOGY, 2009, 136 (01) :138-148