Novel machine learning models outperform risk scores in predicting hepatocellular carcinoma in patients with chronic viral hepatitis

被引:44
作者
Wong, Grace Lai-Hung [1 ,2 ,3 ]
Hui, Vicki Wing-Ki [1 ,2 ]
Tan, Qingxiong [4 ]
Xu, Jingwen [4 ]
Lee, Hye Won [5 ]
Yip, Terry Cheuk-Fung [1 ,2 ,3 ]
Yang, Baoyao [4 ]
Tse, Yee-Kit [1 ,2 ]
Yin, Chong [4 ]
Lyu, Fei [4 ]
Lai, Jimmy Che-To [1 ,2 ,3 ]
Lui, Grace Chung-Yan [2 ]
Chan, Henry Lik-Yuen [1 ,6 ]
Yuen, Pong-Chi [4 ]
Wong, Vincent Wai-Sun [1 ,2 ,3 ]
机构
[1] Chinese Univ Hong Kong, Med Data Analyt Ctr, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Med & Therapeut, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Inst Digest Dis, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[5] Yonsei Univ, Dept Internal Med, Coll Med, Seoul, South Korea
[6] Union Hosp, Hong Kong, Peoples R China
关键词
Antiviral treatment; Cirrhosis; Liver cancer; Mortality; World Health Organization; FATTY LIVER-DISEASE; FIBROSIS;
D O I
10.1016/j.jhepr.2022.100441
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background & Aims: Accurate hepatocellular carcinoma (HCC) risk prediction facilitates appropriate surveillance strategy and reduces cancer mortality. We aimed to derive and validate novel machine learning models to predict HCC in a territorywide cohort of patients with chronic viral hepatitis (CVH) using data from the Hospital Authority Data Collaboration Lab (HADCL). Methods: This was a territory-wide, retrospective, observational, cohort study of patients with CVH in Hong Kong in 2000-2018 identified from HADCL based on viral markers, diagnosis codes, and antiviral treatment for chronic hepatitis B and/or C. The cohort was randomly split into training and validation cohorts in a 7:3 ratio. Five popular machine learning methods, namely, logistic regression, ridge regression, AdaBoost, decision tree, and random forest, were performed and compared to find the best prediction model. Results: A total of 124,006 patients with CVH with complete data were included to build the models. In the training cohort (n = 86,804; 6,821 HCC), ridge regression (area under the receiver operating characteristic curve [AUROC] 0.842), decision tree (0.952), and random forest (0.992) performed the best. In the validation cohort (n = 37,202; 2,875 HCC), ridge regression (AUROC 0.844) and random forest (0.837) maintained their accuracy, which was significantly higher than those of HCC risk scores: CU-HCC (0.672), GAG-HCC (0.745), REACH-B (0.671), PAGE-B (0.748), and REAL-B (0.712) scores. The low cut-off (0.07) of HCC ridge score (HCC-RS) achieved 90.0% sensitivity and 98.6% negative predictive value (NPV) in the validation cohort. The high cut-off (0.15) of HCC-RS achieved high specificity (90.0%) and NPV (95.6%); 31.1% of patients remained indeterminate. Conclusions: HCC-RS from the ridge regression machine learning model accurately predicted HCC in patients with CVH. These machine learning models may be developed as built-in functional keys or calculators in electronic health systems to reduce cancer mortality. Lay summary: Novel machine learning models generated accurate risk scores for hepatocellular carcinoma (HCC) in patients with chronic viral hepatitis. HCC ridge score was consistently more accurate than existing HCC risk scores. These models may be incorporated into electronic medical health systems to develop appropriate cancer surveillance strategies and reduce cancer death. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of European Association for the Study of the Liver (EASL).
引用
收藏
页数:11
相关论文
共 30 条
[11]   Improvement in enhanced liver fibrosis score and liver stiffness measurement reflects lower risk of hepatocellular carcinoma [J].
Liang, Lilian Yan ;
Wong, Vincent Wai-Sun ;
Tse, Yee-Kit ;
Yip, Terry Cheuk-Fung ;
Lui, Grace Chung-Yan ;
Chan, Henry Lik-Yuen ;
Wong, Grace Lai-Hung .
ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2019, 49 (12) :1509-1517
[12]   A Machine Learning Approach Yields a Multiparameter Prognostic Marker in Liver Cancer [J].
Liu, Xiaoli ;
Lu, Jilin ;
Zhang, Guanxiong ;
Han, Junyan ;
Zhou, Wei ;
Chen, Huan ;
Zhang, Henghui ;
Yang, Zhiyun .
CANCER IMMUNOLOGY RESEARCH, 2021, 9 (03) :337-347
[13]   Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017 [J].
Roth, Gregory A. ;
Abate, Degu ;
Abate, Kalkidan Hassen ;
Abay, Solomon M. ;
Abbafati, Cristiana ;
Abbasi, Nooshin ;
Abbastabar, Hedayat ;
Abd-Allah, Load ;
Abdela, Jemal ;
Abdelalim, Ahmed ;
Abdollahpour, Ibrahim ;
Abdulkader, Rizwan Suliankatchi ;
Abebe, Haftom Temesgen ;
Abebe, Molla ;
Abebe, Zegeye ;
Abejie, Ayenew Negesse ;
Abera, Semaw F. ;
Abil, Olifan Zewdie ;
Abraha, Haftom Niguse ;
Abrham, Aklilu Roba ;
Abu-Raddad, Laith Jamal ;
Accrombessi, Manfred Mario Kokou ;
Acharya, Dilaram ;
Adamu, Abdu A. ;
Adebayo, Oladimeji ;
Adedoyin, Rufus Adesoji ;
Adekanmbi, Victor ;
Adookunboh, Olatunii ;
Adhena, Beyene Meressa ;
Adib, Mina G. ;
Admasie, Aniha ;
Afshin, Ashkan ;
Agarwal, Gina ;
Agesa, Karelia M. ;
Agrawal, Anurag ;
Agrawal, Sutapa ;
Ahmadi, Alireza ;
Ahmadi, Melidi ;
Ahmed, Muktar Beshir ;
Ahmed, Sayent ;
Aichour, Amani Nidhal ;
Aichour, Ibtihel ;
Aichour, Miloud Taki Fddine ;
Akbari, Mohammad Esmaeil ;
Akinyeniti, Rufus Olusola ;
Akseer, Nadia ;
Al-Aly, Ziyad ;
Al-Eyadhy, Ayman ;
Al-Raddadi, Rajaa M. ;
Alandab, Fares .
LANCET, 2018, 392 (10159) :1736-1788
[14]   Liver diseases in the Asia-Pacific region: a Lancet Gastroenterology & Hepatology Commission [J].
Sarin, Shiv K. ;
Kumar, Manoj ;
Eslam, Mohammed ;
George, Jacob ;
Al Mahtab, Mamun ;
Akbar, Sheikh M. Fazie ;
Jia, Jidong ;
Tian, Qiuju ;
Aggarwal, Rakesh ;
Muljono, David H. ;
Omata, Masao ;
Ooka, Yoshihiko ;
Han, Kwang-Hyub ;
Lee, Hye W. ;
Jafri, Wasim ;
Butt, Amna S. ;
Chong, Chern H. ;
Lim, Seng G. ;
Pwu, Raoh-Fang ;
Chen, Ding-Shinn .
LANCET GASTROENTEROLOGY & HEPATOLOGY, 2020, 5 (02) :167-228
[15]   Explainable Uncertainty-Aware Convolutional Recurrent Neural Network for Irregular Medical Time Series [J].
Tan, Qingxiong ;
Ye, Mang ;
Ma, Andy Jinhua ;
Yang, Baoyao ;
Yip, Terry Cheuk-Fung ;
Wong, Grace Lai-Hung ;
Yuen, Pong C. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) :4665-4679
[16]  
WHO, HEP C FACT SHEET
[17]  
WHO, 2017, World malaria report
[18]   Development of a non-invasive algorithm with transient elastography (Fibroscan) and serum test formula for advanced liver fibrosis in chronic hepatitis B [J].
Wong, G. L. H. ;
Wong, V. W. S. ;
Choi, P. C. L. ;
Chan, A. W. H. ;
Chan, H. L. Y. .
ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2010, 31 (10) :1095-1103
[19]   Artificial intelligence in prediction of non-alcoholic fatty liver disease and fibrosis [J].
Wong, Grace Lai-Hung ;
Yuen, Pong-Chi ;
Ma, Andy Jinhua ;
Chan, Anthony Wing-Hung ;
Leung, Howard Ho-Wai ;
Wong, Vincent Wai-Sun .
JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2021, 36 (03) :543-550
[20]   An Aging Population of Chronic Hepatitis B With Increasing Comorbidities: A Territory-Wide Study From 2000 to 2017 [J].
Wong, Grace Lai-Hung ;
Wong, Vincent Wai-Sun ;
Yuen, Becky Wing-Yan ;
Tse, Yee-Kit ;
Luk, Hester Wing-Sum ;
Yip, Terry Cheuk-Fung ;
Hui, Vicki Wing-Ki ;
Liang, Lilian Yan ;
Lui, Grace Chung-Yan ;
Chan, Henry Lik-Yuen .
HEPATOLOGY, 2020, 71 (02) :444-455