Machine learning models for predicting metabolic dysfunction-associated steatotic liver disease prevalence using basic demographic and clinical characteristics

被引:0
作者
Zhu, Gangfeng [1 ]
Song, Yipeng [1 ]
Lu, Zenghong [2 ]
Yi, Qiang [1 ]
Xu, Rui [3 ]
Xie, Yi [1 ]
Geng, Shi [4 ]
Yang, Na [4 ,5 ]
Zheng, Liangjian [1 ]
Feng, Xiaofei [2 ]
Zhu, Rui [2 ]
Wang, Xiangcai [2 ]
Huang, Li [2 ]
Xiang, Yi [2 ,6 ]
机构
[1] Gannan Med Univ, Clin Med Coll 1, Ganzhou 341000, Jiangxi Provinc, Peoples R China
[2] Gannan Med Univ, Affiliated Hosp 1, Jiangxi Clin Res Ctr Canc, Dept Oncol, Ganzhou 341000, Jiangxi Provinc, Peoples R China
[3] Zhejiang Univ, Affiliated Jinhua Hosp, Dept Rehabil Med, Sch Med, Jinhua 321000, Zhejiang Provin, Peoples R China
[4] Xuzhou Med Univ, Affiliated Hosp, Dept Med Equipment Management, Artificial Intelligence Unit, Xuzhou, Peoples R China
[5] Nanjing Med Univ, Jiangsu Prov Engn Res Ctr Smart Wearable & Rehabil, Sch Biomed Engn & Informat, Nanjing, Peoples R China
[6] Southeast Univ, Zhongda Hosp, Liver Dis Ctr Integrated Tradit Chinese & Western, Dept Radiol,Med Sch,Nurturing Ctr Jiangsu Prov Sta, Nanjing, Peoples R China
关键词
Metabolic dysfunction-associated steatotic liver disease; Demographic and clinical characteristics; Machine learning; Non-invasive screening; National health and nutrition examination survey; RISK STRATIFICATION; BIOMARKERS; FIBROSIS;
D O I
10.1186/s12967-025-06387-5
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background Metabolic dysfunction-associated steatotic liver disease (MASLD) is a global health concern that necessitates early screening and timely intervention to improve prognosis. The current diagnostic protocols for MASLD involve complex procedures in specialised medical centres. This study aimed to explore the feasibility of utilising machine learning models to accurately screen for MASLD in large populations based on a combination of essential demographic and clinical characteristics. Methods A total of 10,007 outpatients who underwent transient elastography at the First Affiliated Hospital of Gannan Medical University were enrolled to form a derivation cohort. Using eight demographic and clinical characteristics (age, educational level, height, weight, waist and hip circumference, and history of hypertension and diabetes), we built predictive models for MASLD (classified as none or mild: controlled attenuation parameter (CAP) <= 269 dB/m; moderate: 269-296 dB/m; severe: CAP > 296 dB/m) employing 10 machine learning algorithms: logistic regression (LR), multilayer perceptron (MLP), extreme gradient boosting (XGBoost), bootstrap aggregating, decision tree, K-nearest neighbours, light gradient boosting machine, naive Bayes, random forest, and support vector machine. These models were externally validated using the National Health and Nutrition Examination Survey (NHANES) 2017-2023 datasets. Results In the hospital outpatient cohort, machine learning algorithms demonstrated robust predictive capabilities. Notably, LR achieved the highest accuracy (ACC) of 0.711 in the test cohort and 0.728 in the validation cohort, coupled with robust areas under the receiver operating characteristic curve (AUC) values of 0.798 and 0.806, respectively. Similarly, MLP and XGBoost showed promising results, with MLP achieving an ACC of 0.735 in the test cohort, and XGBoost registering an AUC of 0.798. External validation using the NHANES datasets yielded consistent AUC results, with LR (0.831), MLP (0.823), and XGBoost (0.784) performing robustly. Conclusions This study demonstrated that machine learning models constructed using a combination of essential demographic and clinical characteristics can accurately screen for MASLD in the general population. This approach significantly enhances the feasibility, accessibility, and compliance of MASLD screening and provides an effective tool for large-scale health assessments and early intervention strategies.
引用
收藏
页数:14
相关论文
共 45 条
  • [1] Community pathways for the early detection and risk stratification of chronic liver disease: a narrative systematic review
    Abeysekera, Kushala W. M.
    Macpherson, Iain
    Glyn-Owen, Kate
    McPherson, Stuart
    Parker, Richard
    Harris, Rebecca
    Yeoman, Andrew
    Rowe, Ian A.
    Dillon, John F.
    [J]. LANCET GASTROENTEROLOGY & HEPATOLOGY, 2022, 7 (08): : 770 - 780
  • [2] Prediction of diabetes disease using an ensemble of machine learning multi-classifier models
    Abnoosian, Karlo
    Farnoosh, Rahman
    Behzadi, Mohammad Hassan
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [3] A prospective study on the prevalence of NAFLD, advanced fibrosis, cirrhosis and hepatocellular carcinoma in people with type 2 diabetes
    Ajmera, Veeral
    Cepin, Sandra
    Tesfai, Kaleb
    Hofflich, Heather
    Cadman, Karen
    Lopez, Scarlett
    Madamba, Egbert
    Bettencourt, Ricki
    Richards, Lisa
    Behling, Cynthia
    Sirlin, Claude B.
    Loomba, Rohit
    [J]. JOURNAL OF HEPATOLOGY, 2023, 78 (03) : 471 - 478
  • [4] Stratifying individuals into non-alcoholic fatty liver disease risk levels using time series machine learning models
    Ben-Assuli, Ofir
    Jacobi, Arie
    Goldman, Orit
    Shenhar-Tsarfaty, Shani
    Rogowski, Ori
    Zeltser, David
    Shapira, Itzhak
    Berliner, Shlomo
    Zelber-Sagi, Shira
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 126
  • [5] Machine Learning-Based Biomarker Identification for Early Diagnosis of Metabolic Dysfunction-Associated Steatotic Liver Disease
    Boullion, Jolie
    Husein, Amanda
    Agrawal, Akshat
    Xing, Diensn
    Hossain, Md Ismail
    Bhuiyan, Md Shenuarin
    Rom, Oren
    Conrad, Steven A.
    Vanchiere, John A.
    Orr, A. Wayne
    Kevil, Christopher G.
    Bhuiyan, Mohammad Alfrad Nobel
    [J]. JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2025,
  • [6] Cales Paul, 2024, J Hepatol, DOI 10.1016/j.jhep.2024.11.049
  • [7] Phenotypes of non-alcoholic fatty liver disease (NAFLD) and all-cause mortality: unsupervised machine learning analysis of NHANES III
    Carrillo-Larco, Rodrigo M.
    Cristobal Guzman-Vilca, Wilmer
    Castillo-Cara, Manuel
    Alvizuri-Gomez, Claudia
    Alqahtani, Saleh
    Garcia-Larsen, Vanessa
    [J]. BMJ OPEN, 2022, 12 (11):
  • [8] Longitudinal Outcomes Associated With Metabolic Dysfunction-Associated Steatotic Liver Disease: A Meta-analysis of 129 Studies
    Chan, Kai En
    Ong, Elden Yen Hng
    Chung, Charlotte Hui
    Ong, Christen En Ya
    Koh, Benjamin
    Tan, Darren Jun Hao
    Lim, Wen Hui
    Yong, Jie Ning
    Xiao, Jieling
    Wong, Zhen Yu
    Syn, Nicholas
    Kaewdech, Apichat
    Teng, Margaret
    Wang, Jiong-Wei
    Chew, Nicholas
    Young, Dan Yock
    Know, Alfred
    Siddiqui, Mohammad Shadab
    Huang, Daniel Q.
    Tamaki, Nobuharu
    Wong, Vincent Wai-Sun
    Mantzoros, Christos S.
    Sanyal, Arun
    Noureddin, Mazen
    Ng, Cheng Han
    Muthiah, Mark
    [J]. CLINICAL GASTROENTEROLOGY AND HEPATOLOGY, 2024, 22 (03) : 488 - 498
  • [9] Development and validation of machine learning models for MASLD: based on multiple potential screening indicators
    Chen, Hao
    Zhang, Jingjing
    Chen, Xueqin
    Luo, Ling
    Dong, Wenjiao
    Wang, Yongjie
    Zhou, Jiyu
    Chen, Canjin
    Wang, Wenhao
    Zhang, Wenbin
    Zhang, Zhiyi
    Cai, Yongguang
    Kong, Danli
    Ding, Yuanlin
    [J]. FRONTIERS IN ENDOCRINOLOGY, 2025, 15
  • [10] Non-alcoholic fatty liver disease screening in type 2 diabetes mellitus: A cost-effectiveness and price threshold analysis
    Choo, Bryan Peide
    Goh, George Boon -Bee
    Chia, Sing Yi
    Oh, Hong Choon
    Tan, Ngiap Chuan
    Tan, Jessica Yi Lyn
    Ang, Tiing Leong
    Bee, Yong Mong
    Wong, Yu Jun
    [J]. ANNALS ACADEMY OF MEDICINE SINGAPORE, 2022, 51 (11) : 686 - 694