Machine learning models for predicting metabolic dysfunction-associated steatotic liver disease prevalence using basic demographic and clinical characteristics

被引:2
作者
Zhu, Gangfeng [1 ]
Song, Yipeng [1 ]
Lu, Zenghong [2 ]
Yi, Qiang [1 ]
Xu, Rui [3 ]
Xie, Yi [1 ]
Geng, Shi [4 ]
Yang, Na [4 ,5 ]
Zheng, Liangjian [1 ]
Feng, Xiaofei [2 ]
Zhu, Rui [2 ]
Wang, Xiangcai [2 ]
Huang, Li [2 ]
Xiang, Yi [2 ,6 ]
机构
[1] Gannan Med Univ, Clin Med Coll 1, Ganzhou 341000, Jiangxi Provinc, Peoples R China
[2] Gannan Med Univ, Affiliated Hosp 1, Jiangxi Clin Res Ctr Canc, Dept Oncol, Ganzhou 341000, Jiangxi Provinc, Peoples R China
[3] Zhejiang Univ, Affiliated Jinhua Hosp, Dept Rehabil Med, Sch Med, Jinhua 321000, Zhejiang Provin, Peoples R China
[4] Xuzhou Med Univ, Affiliated Hosp, Dept Med Equipment Management, Artificial Intelligence Unit, Xuzhou, Peoples R China
[5] Nanjing Med Univ, Jiangsu Prov Engn Res Ctr Smart Wearable & Rehabil, Sch Biomed Engn & Informat, Nanjing, Peoples R China
[6] Southeast Univ, Zhongda Hosp, Liver Dis Ctr Integrated Tradit Chinese & Western, Dept Radiol,Med Sch,Nurturing Ctr Jiangsu Prov Sta, Nanjing, Peoples R China
关键词
Metabolic dysfunction-associated steatotic liver disease; Demographic and clinical characteristics; Machine learning; Non-invasive screening; National health and nutrition examination survey; RISK STRATIFICATION; BIOMARKERS; FIBROSIS;
D O I
10.1186/s12967-025-06387-5
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background Metabolic dysfunction-associated steatotic liver disease (MASLD) is a global health concern that necessitates early screening and timely intervention to improve prognosis. The current diagnostic protocols for MASLD involve complex procedures in specialised medical centres. This study aimed to explore the feasibility of utilising machine learning models to accurately screen for MASLD in large populations based on a combination of essential demographic and clinical characteristics. Methods A total of 10,007 outpatients who underwent transient elastography at the First Affiliated Hospital of Gannan Medical University were enrolled to form a derivation cohort. Using eight demographic and clinical characteristics (age, educational level, height, weight, waist and hip circumference, and history of hypertension and diabetes), we built predictive models for MASLD (classified as none or mild: controlled attenuation parameter (CAP) <= 269 dB/m; moderate: 269-296 dB/m; severe: CAP > 296 dB/m) employing 10 machine learning algorithms: logistic regression (LR), multilayer perceptron (MLP), extreme gradient boosting (XGBoost), bootstrap aggregating, decision tree, K-nearest neighbours, light gradient boosting machine, naive Bayes, random forest, and support vector machine. These models were externally validated using the National Health and Nutrition Examination Survey (NHANES) 2017-2023 datasets. Results In the hospital outpatient cohort, machine learning algorithms demonstrated robust predictive capabilities. Notably, LR achieved the highest accuracy (ACC) of 0.711 in the test cohort and 0.728 in the validation cohort, coupled with robust areas under the receiver operating characteristic curve (AUC) values of 0.798 and 0.806, respectively. Similarly, MLP and XGBoost showed promising results, with MLP achieving an ACC of 0.735 in the test cohort, and XGBoost registering an AUC of 0.798. External validation using the NHANES datasets yielded consistent AUC results, with LR (0.831), MLP (0.823), and XGBoost (0.784) performing robustly. Conclusions This study demonstrated that machine learning models constructed using a combination of essential demographic and clinical characteristics can accurately screen for MASLD in the general population. This approach significantly enhances the feasibility, accessibility, and compliance of MASLD screening and provides an effective tool for large-scale health assessments and early intervention strategies.
引用
收藏
页数:14
相关论文
共 45 条
[1]   Community pathways for the early detection and risk stratification of chronic liver disease: a narrative systematic review [J].
Abeysekera, Kushala W. M. ;
Macpherson, Iain ;
Glyn-Owen, Kate ;
McPherson, Stuart ;
Parker, Richard ;
Harris, Rebecca ;
Yeoman, Andrew ;
Rowe, Ian A. ;
Dillon, John F. .
LANCET GASTROENTEROLOGY & HEPATOLOGY, 2022, 7 (08) :770-780
[2]   Prediction of diabetes disease using an ensemble of machine learning multi-classifier models [J].
Abnoosian, Karlo ;
Farnoosh, Rahman ;
Behzadi, Mohammad Hassan .
BMC BIOINFORMATICS, 2023, 24 (01)
[3]   A prospective study on the prevalence of NAFLD, advanced fibrosis, cirrhosis and hepatocellular carcinoma in people with type 2 diabetes [J].
Ajmera, Veeral ;
Cepin, Sandra ;
Tesfai, Kaleb ;
Hofflich, Heather ;
Cadman, Karen ;
Lopez, Scarlett ;
Madamba, Egbert ;
Bettencourt, Ricki ;
Richards, Lisa ;
Behling, Cynthia ;
Sirlin, Claude B. ;
Loomba, Rohit .
JOURNAL OF HEPATOLOGY, 2023, 78 (03) :471-478
[4]   Stratifying individuals into non-alcoholic fatty liver disease risk levels using time series machine learning models [J].
Ben-Assuli, Ofir ;
Jacobi, Arie ;
Goldman, Orit ;
Shenhar-Tsarfaty, Shani ;
Rogowski, Ori ;
Zeltser, David ;
Shapira, Itzhak ;
Berliner, Shlomo ;
Zelber-Sagi, Shira .
JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 126
[5]   Machine Learning-Based Biomarker Identification for Early Diagnosis of Metabolic Dysfunction-Associated Steatotic Liver Disease [J].
Boullion, Jolie ;
Husein, Amanda ;
Agrawal, Akshat ;
Xing, Diensn ;
Hossain, Md Ismail ;
Bhuiyan, Md Shenuarin ;
Rom, Oren ;
Conrad, Steven A. ;
Vanchiere, John A. ;
Orr, A. Wayne ;
Kevil, Christopher G. ;
Bhuiyan, Mohammad Alfrad Nobel .
JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2025,
[6]   A new generation of non-invasive tests of liver fibrosis with improved accuracy in MASLD [J].
Cales, Paul ;
Canivet, Clemence M. ;
Costentin, Charlotte ;
Lannes, Adrien ;
Oberti, Frederic ;
Fouchard, Isabelle ;
Hunault, Gilles ;
de Ledinghen, Victor ;
Boursier, Jerome .
JOURNAL OF HEPATOLOGY, 2025, 82 (05) :794-804
[7]   Phenotypes of non-alcoholic fatty liver disease (NAFLD) and all-cause mortality: unsupervised machine learning analysis of NHANES III [J].
Carrillo-Larco, Rodrigo M. ;
Cristobal Guzman-Vilca, Wilmer ;
Castillo-Cara, Manuel ;
Alvizuri-Gomez, Claudia ;
Alqahtani, Saleh ;
Garcia-Larsen, Vanessa .
BMJ OPEN, 2022, 12 (11)
[8]   Longitudinal Outcomes Associated With Metabolic Dysfunction-Associated Steatotic Liver Disease: A Meta-analysis of 129 Studies [J].
Chan, Kai En ;
Ong, Elden Yen Hng ;
Chung, Charlotte Hui ;
Ong, Christen En Ya ;
Koh, Benjamin ;
Tan, Darren Jun Hao ;
Lim, Wen Hui ;
Yong, Jie Ning ;
Xiao, Jieling ;
Wong, Zhen Yu ;
Syn, Nicholas ;
Kaewdech, Apichat ;
Teng, Margaret ;
Wang, Jiong-Wei ;
Chew, Nicholas ;
Young, Dan Yock ;
Know, Alfred ;
Siddiqui, Mohammad Shadab ;
Huang, Daniel Q. ;
Tamaki, Nobuharu ;
Wong, Vincent Wai-Sun ;
Mantzoros, Christos S. ;
Sanyal, Arun ;
Noureddin, Mazen ;
Ng, Cheng Han ;
Muthiah, Mark .
CLINICAL GASTROENTEROLOGY AND HEPATOLOGY, 2024, 22 (03) :488-498
[9]   Development and validation of machine learning models for MASLD: based on multiple potential screening indicators [J].
Chen, Hao ;
Zhang, Jingjing ;
Chen, Xueqin ;
Luo, Ling ;
Dong, Wenjiao ;
Wang, Yongjie ;
Zhou, Jiyu ;
Chen, Canjin ;
Wang, Wenhao ;
Zhang, Wenbin ;
Zhang, Zhiyi ;
Cai, Yongguang ;
Kong, Danli ;
Ding, Yuanlin .
FRONTIERS IN ENDOCRINOLOGY, 2025, 15
[10]   Non-alcoholic fatty liver disease screening in type 2 diabetes mellitus: A cost-effectiveness and price threshold analysis [J].
Choo, Bryan Peide ;
Goh, George Boon -Bee ;
Chia, Sing Yi ;
Oh, Hong Choon ;
Tan, Ngiap Chuan ;
Tan, Jessica Yi Lyn ;
Ang, Tiing Leong ;
Bee, Yong Mong ;
Wong, Yu Jun .
ANNALS ACADEMY OF MEDICINE SINGAPORE, 2022, 51 (11) :686-694