Novel machine learning models outperform risk scores in predicting hepatocellular carcinoma in patients with chronic viral hepatitis

被引:39
作者
Wong, Grace Lai-Hung [1 ,2 ,3 ]
Hui, Vicki Wing-Ki [1 ,2 ]
Tan, Qingxiong [4 ]
Xu, Jingwen [4 ]
Lee, Hye Won [5 ]
Yip, Terry Cheuk-Fung [1 ,2 ,3 ]
Yang, Baoyao [4 ]
Tse, Yee-Kit [1 ,2 ]
Yin, Chong [4 ]
Lyu, Fei [4 ]
Lai, Jimmy Che-To [1 ,2 ,3 ]
Lui, Grace Chung-Yan [2 ]
Chan, Henry Lik-Yuen [1 ,6 ]
Yuen, Pong-Chi [4 ]
Wong, Vincent Wai-Sun [1 ,2 ,3 ]
机构
[1] Chinese Univ Hong Kong, Med Data Analyt Ctr, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Med & Therapeut, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Inst Digest Dis, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[5] Yonsei Univ, Dept Internal Med, Coll Med, Seoul, South Korea
[6] Union Hosp, Hong Kong, Peoples R China
关键词
Antiviral treatment; Cirrhosis; Liver cancer; Mortality; World Health Organization; FATTY LIVER-DISEASE; FIBROSIS;
D O I
10.1016/j.jhepr.2022.100441
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background & Aims: Accurate hepatocellular carcinoma (HCC) risk prediction facilitates appropriate surveillance strategy and reduces cancer mortality. We aimed to derive and validate novel machine learning models to predict HCC in a territorywide cohort of patients with chronic viral hepatitis (CVH) using data from the Hospital Authority Data Collaboration Lab (HADCL). Methods: This was a territory-wide, retrospective, observational, cohort study of patients with CVH in Hong Kong in 2000-2018 identified from HADCL based on viral markers, diagnosis codes, and antiviral treatment for chronic hepatitis B and/or C. The cohort was randomly split into training and validation cohorts in a 7:3 ratio. Five popular machine learning methods, namely, logistic regression, ridge regression, AdaBoost, decision tree, and random forest, were performed and compared to find the best prediction model. Results: A total of 124,006 patients with CVH with complete data were included to build the models. In the training cohort (n = 86,804; 6,821 HCC), ridge regression (area under the receiver operating characteristic curve [AUROC] 0.842), decision tree (0.952), and random forest (0.992) performed the best. In the validation cohort (n = 37,202; 2,875 HCC), ridge regression (AUROC 0.844) and random forest (0.837) maintained their accuracy, which was significantly higher than those of HCC risk scores: CU-HCC (0.672), GAG-HCC (0.745), REACH-B (0.671), PAGE-B (0.748), and REAL-B (0.712) scores. The low cut-off (0.07) of HCC ridge score (HCC-RS) achieved 90.0% sensitivity and 98.6% negative predictive value (NPV) in the validation cohort. The high cut-off (0.15) of HCC-RS achieved high specificity (90.0%) and NPV (95.6%); 31.1% of patients remained indeterminate. Conclusions: HCC-RS from the ridge regression machine learning model accurately predicted HCC in patients with CVH. These machine learning models may be developed as built-in functional keys or calculators in electronic health systems to reduce cancer mortality. Lay summary: Novel machine learning models generated accurate risk scores for hepatocellular carcinoma (HCC) in patients with chronic viral hepatitis. HCC ridge score was consistently more accurate than existing HCC risk scores. These models may be incorporated into electronic medical health systems to develop appropriate cancer surveillance strategies and reduce cancer death. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of European Association for the Study of the Liver (EASL).
引用
收藏
页数:11
相关论文
共 30 条
  • [11] Improvement in enhanced liver fibrosis score and liver stiffness measurement reflects lower risk of hepatocellular carcinoma
    Liang, Lilian Yan
    Wong, Vincent Wai-Sun
    Tse, Yee-Kit
    Yip, Terry Cheuk-Fung
    Lui, Grace Chung-Yan
    Chan, Henry Lik-Yuen
    Wong, Grace Lai-Hung
    [J]. ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2019, 49 (12) : 1509 - 1517
  • [12] A Machine Learning Approach Yields a Multiparameter Prognostic Marker in Liver Cancer
    Liu, Xiaoli
    Lu, Jilin
    Zhang, Guanxiong
    Han, Junyan
    Zhou, Wei
    Chen, Huan
    Zhang, Henghui
    Yang, Zhiyun
    [J]. CANCER IMMUNOLOGY RESEARCH, 2021, 9 (03) : 337 - 347
  • [13] Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017
    Roth, Gregory A.
    Abate, Degu
    Abate, Kalkidan Hassen
    Abay, Solomon M.
    Abbafati, Cristiana
    Abbasi, Nooshin
    Abbastabar, Hedayat
    Abd-Allah, Load
    Abdela, Jemal
    Abdelalim, Ahmed
    Abdollahpour, Ibrahim
    Abdulkader, Rizwan Suliankatchi
    Abebe, Haftom Temesgen
    Abebe, Molla
    Abebe, Zegeye
    Abejie, Ayenew Negesse
    Abera, Semaw F.
    Abil, Olifan Zewdie
    Abraha, Haftom Niguse
    Abrham, Aklilu Roba
    Abu-Raddad, Laith Jamal
    Accrombessi, Manfred Mario Kokou
    Acharya, Dilaram
    Adamu, Abdu A.
    Adebayo, Oladimeji
    Adedoyin, Rufus Adesoji
    Adekanmbi, Victor
    Adookunboh, Olatunii
    Adhena, Beyene Meressa
    Adib, Mina G.
    Admasie, Aniha
    Afshin, Ashkan
    Agarwal, Gina
    Agesa, Karelia M.
    Agrawal, Anurag
    Agrawal, Sutapa
    Ahmadi, Alireza
    Ahmadi, Melidi
    Ahmed, Muktar Beshir
    Ahmed, Sayent
    Aichour, Amani Nidhal
    Aichour, Ibtihel
    Aichour, Miloud Taki Fddine
    Akbari, Mohammad Esmaeil
    Akinyeniti, Rufus Olusola
    Akseer, Nadia
    Al-Aly, Ziyad
    Al-Eyadhy, Ayman
    Al-Raddadi, Rajaa M.
    Alandab, Fares
    [J]. LANCET, 2018, 392 (10159) : 1736 - 1788
  • [14] Liver diseases in the Asia-Pacific region: a Lancet Gastroenterology & Hepatology Commission
    Sarin, Shiv K.
    Kumar, Manoj
    Eslam, Mohammed
    George, Jacob
    Al Mahtab, Mamun
    Akbar, Sheikh M. Fazie
    Jia, Jidong
    Tian, Qiuju
    Aggarwal, Rakesh
    Muljono, David H.
    Omata, Masao
    Ooka, Yoshihiko
    Han, Kwang-Hyub
    Lee, Hye W.
    Jafri, Wasim
    Butt, Amna S.
    Chong, Chern H.
    Lim, Seng G.
    Pwu, Raoh-Fang
    Chen, Ding-Shinn
    [J]. LANCET GASTROENTEROLOGY & HEPATOLOGY, 2020, 5 (02): : 167 - 228
  • [15] Explainable Uncertainty-Aware Convolutional Recurrent Neural Network for Irregular Medical Time Series
    Tan, Qingxiong
    Ye, Mang
    Ma, Andy Jinhua
    Yang, Baoyao
    Yip, Terry Cheuk-Fung
    Wong, Grace Lai-Hung
    Yuen, Pong C.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4665 - 4679
  • [16] WHO, HEP C FACT SHEET
  • [17] WHO, 2017, GLOB HEP REP
  • [18] Development of a non-invasive algorithm with transient elastography (Fibroscan) and serum test formula for advanced liver fibrosis in chronic hepatitis B
    Wong, G. L. H.
    Wong, V. W. S.
    Choi, P. C. L.
    Chan, A. W. H.
    Chan, H. L. Y.
    [J]. ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2010, 31 (10) : 1095 - 1103
  • [19] Artificial intelligence in prediction of non-alcoholic fatty liver disease and fibrosis
    Wong, Grace Lai-Hung
    Yuen, Pong-Chi
    Ma, Andy Jinhua
    Chan, Anthony Wing-Hung
    Leung, Howard Ho-Wai
    Wong, Vincent Wai-Sun
    [J]. JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2021, 36 (03) : 543 - 550
  • [20] An Aging Population of Chronic Hepatitis B With Increasing Comorbidities: A Territory-Wide Study From 2000 to 2017
    Wong, Grace Lai-Hung
    Wong, Vincent Wai-Sun
    Yuen, Becky Wing-Yan
    Tse, Yee-Kit
    Luk, Hester Wing-Sum
    Yip, Terry Cheuk-Fung
    Hui, Vicki Wing-Ki
    Liang, Lilian Yan
    Lui, Grace Chung-Yan
    Chan, Henry Lik-Yuen
    [J]. HEPATOLOGY, 2020, 71 (02) : 444 - 455