Predicting the 5-Year Risk of Nonalcoholic Fatty Liver Disease Using Machine Learning Models: Prospective Cohort Study

被引:7
|
作者
Huang, Guoqing [1 ,2 ]
Jin, Qiankai [1 ,2 ]
Mao, Yushan [1 ]
机构
[1] Ningbo Univ, Affiliated Hosp 1, Dept Endocrinol, 247 Renmin Rd, Ningbo 315000, Peoples R China
[2] Ningbo Univ, Hlth Sci Ctr, Ningbo, Peoples R China
关键词
nonalcoholic fatty liver disease; machine learning; independent risk factors; prediction model; model; fatty liver; prevention; liver; prognostic; China; development; validation; risk model; clinical applicability; NORMAL-WEIGHT; EPIDEMIOLOGY; PREVALENCE;
D O I
10.2196/46891
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Nonalcoholic fatty liver disease (NAFLD) has emerged as a worldwide public health issue. Identifying and targeting populations at a heightened risk of developing NAFLD over a 5-year period can help reduce and delay adverse hepatic prognostic events. Objective: This study aimed to investigate the 5-year incidence of NAFLD in the Chinese population. It also aimed to establish and validate a machine learning model for predicting the 5-year NAFLD risk. Methods: The study population was derived from a 5-year prospective cohort study. A total of 6196 individuals without NAFLD who underwent health checkups in 2010 at Zhenhai Lianhua Hospital in Ningbo, China, were enrolled in this study. Extreme gradient boosting (XGBoost)-recursive feature elimination, combined with the least absolute shrinkage and selection operator (LASSO), was used to screen for characteristic predictors. A total of 6 machine learning models, namely logistic regression, decision tree, support vector machine, random forest, categorical boosting, and XGBoost, were utilized in the construction of a 5-year risk model for NAFLD. Hyperparameter optimization of the predictive model was performed in the training set, and a further evaluation of the model performance was carried out in the internal and external validation sets. Results: The 5-year incidence of NAFLD was 18.64% (n=1155) in the study population. We screened 11 predictors for risk prediction model construction. After the hyperparameter optimization, CatBoost demonstrated the best prediction performance in the training set, with an area under the receiver operating characteristic (AUROC) curve of 0.810 (95% CI 0.768-0.852). Logistic regression showed the best prediction performance in the internal and external validation sets, with AUROC curves of 0.778 (95% CI 0.759-0.794) and 0.806 (95% CI 0.788-0.821), respectively. The development of web-based calculators has enhanced the clinical feasibility of the risk prediction model. Conclusions: Developing and validating machine learning models can aid in predicting which populations are at the highest risk of developing NAFLD over a 5-year period, thereby helping delay and reduce the occurrence of adverse liver prognostic events.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Application of Interpretable Machine Learning Models Based on Ultrasonic Radiomics for Predicting the Risk of Fibrosis Progression in Diabetic Patients with Nonalcoholic Fatty Liver Disease
    Meng, Fei
    Wu, Qin
    Zhang, Wei
    Hou, Shirong
    DIABETES METABOLIC SYNDROME AND OBESITY, 2023, 16 : 3901 - 3913
  • [22] Raman spectroscopic histology using machine learning for nonalcoholic fatty liver disease
    Helal, Khalifa Mohammad
    Taylor, James Nicholas
    Cahyadi, Harsono
    Okajima, Akira
    Tabata, Koji
    Itoh, Yoshito
    Tanaka, Hideo
    Fujita, Katsumasa
    Harada, Yoshinori
    Komatsuzaki, Tamiki
    FEBS LETTERS, 2019, 593 (18) : 2535 - 2544
  • [23] Risk Factors for Nonalcoholic Fatty Liver Disease in the Southern Community Cohort Study
    Sarkar, Sudipa
    Lipworth, Loren
    Kabagambe, Edmond
    Bian, Aihua
    Stewart, Thomas
    Blot, William
    Ikizler, T. Alp
    Hung, Adriana M.
    DIABETES, 2017, 66 : A457 - A457
  • [24] CENTRAL ADIPOSITY, OBESITY DURING EARLY ADULTHOOD, AND RISK OF NONALCOHOLIC FATTY LIVER DISEASE: PROSPECTIVE COHORT STUDY
    Kim, Mi Na
    Simon, Tracey G.
    Corey, Kathleen E.
    Liu, Stuart Po-Hong
    Ma, Wenjie
    Jovani, Manol
    Song, Mingyang
    Chan, Andrew
    HEPATOLOGY, 2019, 70 : 738A - 738A
  • [25] Change in fatty liver status and 5-year risk of incident metabolic syndrome: a retrospective cohort study
    Eun Na Han
    Eun Sun Cheong
    Jeong In Lee
    Min Chul Kim
    Christopher D. Byrne
    Ki-Chul Sung
    Clinical Hypertension, 21 (1)
  • [26] Nonalcoholic Fatty Liver Disease and Cardiovascular Mortality in Older Individuals: A Prospective Cohort Study
    Mahady, Suzanne E.
    Wong, Germaine
    Turner, Robin M.
    Mitchell, Paul
    Macaskill, Petra
    Craig, Jonathan
    George, Jacob
    GASTROENTEROLOGY, 2015, 148 (04) : S1048 - S1048
  • [27] Nonalcoholic Fatty Liver Disease as a Potential Risk Factor for Cardiovascular Disease in Patients with Type 2 Diabetes: A Prospective Cohort Study
    Dehghani Firouzabadi, Mohammad
    Poopak, Amirhossein
    Sheikhy, Ali
    Firouzabadi, Fatemeh Dehghani
    Moosaie, Fatemeh
    Rabizadeh, Soghra
    Momtazmanesh, Sara
    Nakhjavani, Manouchehr
    Esteghamati, Alireza
    INTERNATIONAL JOURNAL OF ENDOCRINOLOGY, 2024, 2024
  • [28] Machine Learning Algorithms for Predicting Fatty Liver Disease
    Pei, Xieyi
    Deng, Qingqing
    Liu, Zhuo
    Yan, Xiang
    Sun, Weiping
    ANNALS OF NUTRITION AND METABOLISM, 2021, 77 (01) : 38 - 45
  • [29] Identification of biomarkers in nonalcoholic fatty liver disease: A machine learning method and experimental study
    Han, Na
    He, Juan
    Shi, Lixin
    Zhang, Miao
    Zheng, Jing
    Fan, Yuanshuo
    FRONTIERS IN GENETICS, 2022, 13
  • [30] LONG-TERM USE OF ANTIBIOTICS AND THE RISK OF NONALCOHOLIC FATTY LIVER DISEASE: A PROSPECTIVE COHORT STUDY AMONG WOMEN
    Kim, Mi Na
    Lo, Chun-Han
    Corey, Kathleen E.
    Luo, Xiao
    Zhang, Xuehong
    Chan, Andrew T.
    Simon, Tracey G.
    HEPATOLOGY, 2020, 72 : 979 - 980