Establishment of a machine learning predictive model for non-alcoholic fatty liver disease: A longitudinal cohort study

被引:2
|
作者
Cao, Tengrui [1 ,2 ]
Zhu, Qian [1 ,2 ,3 ]
Tong, Chao [4 ]
Halengbieke, Aheyeerke [1 ,2 ]
Ni, Xuetong [1 ,2 ]
Tang, Jianmin [1 ,2 ]
Han, Yumei [5 ]
Li, Qiang [5 ]
Yang, Xinghua [1 ,2 ]
机构
[1] Capital Med Univ, Sch Publ Hlth, 10 Xitoutiao, Beijing 100069, Peoples R China
[2] Beijing Municipal Key Lab Clin Epidemiol, 10 Xitoutiao, Beijing 100069, Peoples R China
[3] Chinese Acad Med Sci & Peking Union Med Coll, Natl Canc Ctr, Natl Clin Res Ctr Canc, Canc Hosp,Off Canc Registry, Beijing 100021, Peoples R China
[4] Beijing Ctr Dis Prevent & Control, Beijing 100013, Peoples R China
[5] Beijing Phys Examinat Ctr, Sci & Educ Sect, 59 Beiwei Rd, Beijing 100050, Peoples R China
基金
北京市自然科学基金; 国家重点研发计划;
关键词
Non-alcoholic fatty liver disease; Predictive model; eXtreme gradient boosting; Machine learning; DIAGNOSIS; INDEX; NAFLD; TESTS;
D O I
10.1016/j.numecd.2024.02.004
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and aims: Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver disease, which lacks effective drug treatments. This study aimed to construct an eXtreme Gradient Boosting (XGBoost) prediction model to identify or evaluate potential NAFLD patients. Methods and results: We conducted a longitudinal study of 22,140 individuals from the Beijing Health Management Cohort. Variable filtering was performed using the least absolute shrinkage and selection operator. Random Over Sampling Examples was used to address imbalanced data. Next, the XGBoost model and the other three machine learning (ML) models were built using balanced data. Finally, the variable importance of the XGBoost model was ranked. Among four ML algorithms, we got that the XGBoost model outperformed the other models with the following results: accuracy of 0.835, sensitivity of 0.835, specificity of 0.834, Youden index of 0.669, precision of 0.831, recall of 0.835, F-1 score of 0.833, and an area under the curve of 0.914. The top five variables with the greatest impact on the onset of NAFLD were aspartate aminotransferase, cardiometabolic index, body mass index, alanine aminotransferase, and triglyceride-glucose index. Conclusion: The predictive model based on the XGBoost algorithm enables early prediction of the onset of NAFLD. Additionally, assessing variable importance provides valuable insights into the prevention and treatment of NAFLD. (c) 2024 The Italian Diabetes Society, the Italian Society for the Study of Atherosclerosis, the Italian Society of Human Nutrition and the Department of Clinical Medicine and Surgery, Federico II University. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1456 / 1466
页数:11
相关论文
共 50 条
  • [31] Precision Medicine in Fatty Liver Disease/Non-Alcoholic Fatty Liver Disease
    Valenzuela-Vallejo, Laura
    Sanoudou, Despina
    Mantzoros, Christos S.
    JOURNAL OF PERSONALIZED MEDICINE, 2023, 13 (05):
  • [32] Longitudinal study of circulating miR-122 in a rat model of non-alcoholic fatty liver disease
    Yamada, Hiroya
    Ohashi, Koji
    Suzuki, Koji
    Munetsuna, Eiji
    Ando, Yoshitaka
    Yamazaki, Mirai
    Ishikawa, Hiroaki
    Ichino, Naohiro
    Teradaira, Ryouji
    Hashimoto, Shuji
    CLINICA CHIMICA ACTA, 2015, 446 : 267 - 271
  • [33] Early Identification of Risk Factors in Non-Alcoholic Fatty Liver Disease (NAFLD) Using Machine Learning
    Guarneros-Nolasco, Luis Rolando
    Alor-Hernandez, Giner
    Prieto-Avalos, Guillermo
    Sanchez-Cervantes, Jose Luis
    MATHEMATICS, 2023, 11 (13)
  • [34] Machine Learning-Based Identification of Potentially Novel Non-Alcoholic Fatty Liver Disease Biomarkers
    Shafiha, Roshan
    Bahcivanci, Basak
    Gkoutos, Georgios, V
    Acharjee, Animesh
    BIOMEDICINES, 2021, 9 (11)
  • [35] Non-Alcoholic Fatty Liver Disease, Atherosclerosis, and Cardiovascular Disease in Asia
    Lim, Yohwan
    Jeong, Seogsong
    Hong, Myunghee
    Han, Hyun Wook
    REVIEWS IN CARDIOVASCULAR MEDICINE, 2023, 24 (06)
  • [36] Non-alcoholic Fatty Liver and Liver Fibrosis Predictive Analytics: Risk Prediction and Machine Learning Techniques for Improved Preventive Medicine
    Orit Goldman
    Ofir Ben-Assuli
    Ori Rogowski
    David Zeltser
    Itzhak Shapira
    Shlomo Berliner
    Shira Zelber-Sagi
    Shani Shenhar-Tsarfaty
    Journal of Medical Systems, 2021, 45
  • [37] Non-alcoholic fatty liver disease: a new epidemic in children
    Ciocca, Mirta
    Ramonet, Margarita
    Alvarez, Fernando
    ARCHIVOS ARGENTINOS DE PEDIATRIA, 2016, 114 (06): : 563 - 568
  • [38] Irisin Is Related to Non-Alcoholic Fatty Liver Disease (NAFLD)
    Kosmalski, Marcin
    Drzewoski, Jozef
    Szymczak-Pajor, Izabela
    Zieleniak, Andrzej
    Mikolajczyk-Solinska, Melania
    Kasznicki, Jacek
    Sliwinska, Agnieszka
    BIOMEDICINES, 2022, 10 (09)
  • [39] Non-alcoholic fatty Liver disease
    Calderaro, Julien
    Zafrani, Elie Serge
    ANNALES DE PATHOLOGIE, 2010, 30 (06) : 413 - 420
  • [40] Non-alcoholic fatty liver disease
    Li, Wenhao
    Alazawi, William
    CLINICAL MEDICINE, 2020, 20 (05) : 509 - 512