Novel machine learning models outperform risk scores in predicting hepatocellular carcinoma in patients with chronic viral hepatitis

被引:36
|
作者
Wong, Grace Lai-Hung [1 ,2 ,3 ]
Hui, Vicki Wing-Ki [1 ,2 ]
Tan, Qingxiong [4 ]
Xu, Jingwen [4 ]
Lee, Hye Won [5 ]
Yip, Terry Cheuk-Fung [1 ,2 ,3 ]
Yang, Baoyao [4 ]
Tse, Yee-Kit [1 ,2 ]
Yin, Chong [4 ]
Lyu, Fei [4 ]
Lai, Jimmy Che-To [1 ,2 ,3 ]
Lui, Grace Chung-Yan [2 ]
Chan, Henry Lik-Yuen [1 ,6 ]
Yuen, Pong-Chi [4 ]
Wong, Vincent Wai-Sun [1 ,2 ,3 ]
机构
[1] Chinese Univ Hong Kong, Med Data Analyt Ctr, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Med & Therapeut, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Inst Digest Dis, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[5] Yonsei Univ, Dept Internal Med, Coll Med, Seoul, South Korea
[6] Union Hosp, Hong Kong, Peoples R China
关键词
Antiviral treatment; Cirrhosis; Liver cancer; Mortality; World Health Organization; FATTY LIVER-DISEASE; FIBROSIS;
D O I
10.1016/j.jhepr.2022.100441
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background & Aims: Accurate hepatocellular carcinoma (HCC) risk prediction facilitates appropriate surveillance strategy and reduces cancer mortality. We aimed to derive and validate novel machine learning models to predict HCC in a territorywide cohort of patients with chronic viral hepatitis (CVH) using data from the Hospital Authority Data Collaboration Lab (HADCL). Methods: This was a territory-wide, retrospective, observational, cohort study of patients with CVH in Hong Kong in 2000-2018 identified from HADCL based on viral markers, diagnosis codes, and antiviral treatment for chronic hepatitis B and/or C. The cohort was randomly split into training and validation cohorts in a 7:3 ratio. Five popular machine learning methods, namely, logistic regression, ridge regression, AdaBoost, decision tree, and random forest, were performed and compared to find the best prediction model. Results: A total of 124,006 patients with CVH with complete data were included to build the models. In the training cohort (n = 86,804; 6,821 HCC), ridge regression (area under the receiver operating characteristic curve [AUROC] 0.842), decision tree (0.952), and random forest (0.992) performed the best. In the validation cohort (n = 37,202; 2,875 HCC), ridge regression (AUROC 0.844) and random forest (0.837) maintained their accuracy, which was significantly higher than those of HCC risk scores: CU-HCC (0.672), GAG-HCC (0.745), REACH-B (0.671), PAGE-B (0.748), and REAL-B (0.712) scores. The low cut-off (0.07) of HCC ridge score (HCC-RS) achieved 90.0% sensitivity and 98.6% negative predictive value (NPV) in the validation cohort. The high cut-off (0.15) of HCC-RS achieved high specificity (90.0%) and NPV (95.6%); 31.1% of patients remained indeterminate. Conclusions: HCC-RS from the ridge regression machine learning model accurately predicted HCC in patients with CVH. These machine learning models may be developed as built-in functional keys or calculators in electronic health systems to reduce cancer mortality. Lay summary: Novel machine learning models generated accurate risk scores for hepatocellular carcinoma (HCC) in patients with chronic viral hepatitis. HCC ridge score was consistently more accurate than existing HCC risk scores. These models may be incorporated into electronic medical health systems to develop appropriate cancer surveillance strategies and reduce cancer death. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of European Association for the Study of the Liver (EASL).
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A machine learning model for predicting hepatocellular carcinoma risk in patients with chronic hepatitis B
    Lee, Hye Won
    Kim, Hwiyoung
    Park, Taeyun
    Park, Soo Young
    Chon, Young Eun
    Seo, Yeon Seok
    Lee, Jae Seung
    Park, Jun Yong
    Kim, Do Young
    Ahn, Sang Hoon
    Kim, Beom Kyung
    Kim, Seung Up
    LIVER INTERNATIONAL, 2023, 43 (08) : 1813 - 1821
  • [2] MACHINE LEARNING MODEL IS MORE ACCURATE THAN CONVENTIONAL RISK SCORES TO PREDICT HEPATOCELLULAR CARCINOMA IN PATIENTS WITH CHRONIC VIRAL HEPATITIS
    Wong, Grace L. H.
    Hui, Vicki Wing Ki
    Tan, Qingxiong
    Yip, Terry Cheuk-Fung
    Tse, Yee-Kit
    Yin, Chong
    Wong, Vincent
    Yuen, Pong-Chi
    HEPATOLOGY, 2021, 74 : 92A - 92A
  • [3] Assessing risk scores for predicting hepatocellular carcinoma in Thai patients with chronic hepatitis B
    Kamalapirat, Thanachote
    Yingcharoen, Kesinee
    Ungtrakul, Teerapat
    Soonklang, Kamonwan
    Dechma, Jiraporn
    Chunnuan, Pitchayachuda
    Kusuman, Pattama
    Pothijaroen, Charinthip
    Tawpa, Jantarika
    Cheirsilpa, Kunsuda
    Auewarakul, Chirayu
    JOURNAL OF VIRAL HEPATITIS, 2021, 28 (07) : 1034 - 1041
  • [4] Machine learning in predicting hepatocellular carcinoma in patients with chronic viral hepatitis in Hong Kong: abridged secondary publication
    Wong, G. L. H.
    Chan, H. L. Y.
    Tse, Y. K.
    Yuen, P. C.
    Wong, V. W. S.
    HONG KONG MEDICAL JOURNAL, 2023, 29 (01) : 14 - 17
  • [5] Machine Learning Algorithms Outperform Conventional Regression Models in Predicting Development of Hepatocellular Carcinoma
    Singal, Amit G.
    Mukherjee, Ashin
    Elmunzer, B. Joseph
    Higgins, Peter D. R.
    Lok, Anna S.
    Zhu, Ji
    Marrero, Jorge A.
    Waljee, Akbar K.
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2013, 108 (11): : 1723 - 1730
  • [6] RISK OF HEPATOCELLULAR CARCINOMA IN CHRONIC VIRAL HEPATITIS
    Kane, Lara J.
    Macdonald, Graeme A.
    CANCER FORUM, 2009, 33 (02)
  • [7] Reply to 'Consider more about risk scores for predicting hepatocellular carcinoma in patients with chronic hepatitis B'
    Kamalapirat, Thanachote
    Ungtrakul, Teerapat
    JOURNAL OF VIRAL HEPATITIS, 2021, 28 (10) : 1501 - 1501
  • [8] Machine Learning Algorithms Outperform Conventional Regression Models in Identifying Risk Factors for Hepatocellular Carcinoma in Patients With Cirrhosis
    Singal, Amit G.
    Waljee, Akbar K.
    Mukherjee, Ashin
    Higgins, Peter D.
    Zhu, Ji
    Marrero, Jorge A.
    GASTROENTEROLOGY, 2012, 142 (05) : S984 - S984
  • [9] Risk scores for hepatocellular carcinoma in chronic hepatitis B
    Sherman, Morris
    Kim, Ray W.
    Dziura, James
    HEPATOLOGY, 2015, 61 (06) : 1784 - 1786
  • [10] INTEGRATING QUANTITIATIVE SERUM HBSAG LEVELS INTO RISK SCORES FOR PREDICTING HEPATOCELLULAR CARCINOMA IN CHRONIC HEPATITIS B PATIENTS
    Chen, C. -J.
    Lee, M. -H.
    Yang, H. -I.
    Liu, J.
    Batrla-Utermann, R.
    Jen, C. -L.
    Iloeje, U. H.
    Su, J.
    Lu, S. -N.
    You, S. -L.
    Wang, L. -Y.
    JOURNAL OF HEPATOLOGY, 2012, 56 : S182 - S182