Novel machine learning models outperform risk scores in predicting hepatocellular carcinoma in patients with chronic viral hepatitis

被引:39
作者
Wong, Grace Lai-Hung [1 ,2 ,3 ]
Hui, Vicki Wing-Ki [1 ,2 ]
Tan, Qingxiong [4 ]
Xu, Jingwen [4 ]
Lee, Hye Won [5 ]
Yip, Terry Cheuk-Fung [1 ,2 ,3 ]
Yang, Baoyao [4 ]
Tse, Yee-Kit [1 ,2 ]
Yin, Chong [4 ]
Lyu, Fei [4 ]
Lai, Jimmy Che-To [1 ,2 ,3 ]
Lui, Grace Chung-Yan [2 ]
Chan, Henry Lik-Yuen [1 ,6 ]
Yuen, Pong-Chi [4 ]
Wong, Vincent Wai-Sun [1 ,2 ,3 ]
机构
[1] Chinese Univ Hong Kong, Med Data Analyt Ctr, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Med & Therapeut, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Inst Digest Dis, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[5] Yonsei Univ, Dept Internal Med, Coll Med, Seoul, South Korea
[6] Union Hosp, Hong Kong, Peoples R China
关键词
Antiviral treatment; Cirrhosis; Liver cancer; Mortality; World Health Organization; FATTY LIVER-DISEASE; FIBROSIS;
D O I
10.1016/j.jhepr.2022.100441
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background & Aims: Accurate hepatocellular carcinoma (HCC) risk prediction facilitates appropriate surveillance strategy and reduces cancer mortality. We aimed to derive and validate novel machine learning models to predict HCC in a territorywide cohort of patients with chronic viral hepatitis (CVH) using data from the Hospital Authority Data Collaboration Lab (HADCL). Methods: This was a territory-wide, retrospective, observational, cohort study of patients with CVH in Hong Kong in 2000-2018 identified from HADCL based on viral markers, diagnosis codes, and antiviral treatment for chronic hepatitis B and/or C. The cohort was randomly split into training and validation cohorts in a 7:3 ratio. Five popular machine learning methods, namely, logistic regression, ridge regression, AdaBoost, decision tree, and random forest, were performed and compared to find the best prediction model. Results: A total of 124,006 patients with CVH with complete data were included to build the models. In the training cohort (n = 86,804; 6,821 HCC), ridge regression (area under the receiver operating characteristic curve [AUROC] 0.842), decision tree (0.952), and random forest (0.992) performed the best. In the validation cohort (n = 37,202; 2,875 HCC), ridge regression (AUROC 0.844) and random forest (0.837) maintained their accuracy, which was significantly higher than those of HCC risk scores: CU-HCC (0.672), GAG-HCC (0.745), REACH-B (0.671), PAGE-B (0.748), and REAL-B (0.712) scores. The low cut-off (0.07) of HCC ridge score (HCC-RS) achieved 90.0% sensitivity and 98.6% negative predictive value (NPV) in the validation cohort. The high cut-off (0.15) of HCC-RS achieved high specificity (90.0%) and NPV (95.6%); 31.1% of patients remained indeterminate. Conclusions: HCC-RS from the ridge regression machine learning model accurately predicted HCC in patients with CVH. These machine learning models may be developed as built-in functional keys or calculators in electronic health systems to reduce cancer mortality. Lay summary: Novel machine learning models generated accurate risk scores for hepatocellular carcinoma (HCC) in patients with chronic viral hepatitis. HCC ridge score was consistently more accurate than existing HCC risk scores. These models may be incorporated into electronic medical health systems to develop appropriate cancer surveillance strategies and reduce cancer death. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of European Association for the Study of the Liver (EASL).
引用
收藏
页数:11
相关论文
共 30 条
  • [1] Genotype C hepatitis B virus infection is associated with an increased risk of hepatocellular carcinoma
    Chan, HLY
    Hui, AY
    Wong, ML
    Tse, AML
    Hung, LCT
    Wong, VWS
    Sung, JJY
    [J]. GUT, 2004, 53 (10) : 1494 - 1498
  • [2] A proportional hazards model for the subdistribution of a competing risk
    Fine, JP
    Gray, RJ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) : 496 - 509
  • [3] Government HKSAR, 2020, HONG KONG VIR HEP AC
  • [4] Government HKSAR, BUILD CAR INCL SOC I
  • [5] Health Do, STEER COMM PREV CONT
  • [6] Hershberger Courtney E, 2021, Liver Cancer Int, V2, P33, DOI 10.1002/lci2.25
  • [7] Hospital Authority, DAT CAT
  • [8] Hospital Authority, HOSP AUTH DAT COLL
  • [9] Increasing antiviral treatment uptake improves survival in patients with HBV-related HCC
    Hui, Vicki Wing-Ki
    Chan, Stephen Lam
    Wong, Vincent Wai-Sun
    Liang, Lilian Yan
    Yip, Terry Cheuk-Fung
    Lai, Jimmy Che-To
    Yuen, Becky Wing-Yan
    Luk, Hester Wing-Sum
    Tse, Yee-Kit
    Lee, Hye-Won
    Chan, Henry Lik-Yuen
    Wong, Grace Lai-Hung
    [J]. JHEP REPORTS, 2020, 2 (06)
  • [10] Territory-wide population-based study of chronic hepatitis C infection and implications for hepatitis elimination in Hong Kong
    Hui, Yee Tak
    Wong, Grace L. H.
    Fung, James Y. Y.
    Chan, Henry L. Y.
    Leung, Nancy W. Y.
    Liu, Sienna D.
    Liu, Ken
    Ma, Yiu Keung
    But, David Y. K.
    Mak, Wing Yan
    Chan, Jacky M. C.
    Lai, Kin Bon
    Loo, Ching Kong
    Ng, Annie C. Y.
    Lai, Moon Sing
    Chan, Chun Wing
    Lau, Joulen Y. L.
    Fan, Tina T. T.
    Hui, Aric J.
    Lam, Belsy C. Y.
    Cheung, Wing I.
    Tsang, Owen T. Y.
    Lam, Karen
    Lai, Lawrence S. W.
    Luk, Wai Fan
    Li, Michael K. K.
    Lao, Wai Cheung
    Lam, Jodis T. W.
    Tsang, Steven W. C.
    Kung, Kam Ngai
    Chow, Wai Hung
    Tong, Ronald K. N.
    Lui, Thomas K. L.
    Shan, Edwin H. S.
    Yuen, Man Fung
    Wong, Vincent W. S.
    [J]. LIVER INTERNATIONAL, 2018, 38 (11) : 1911 - 1919