Machine Learning Algorithms for Predicting Fatty Liver Disease

被引:14
作者
Pei, Xieyi [1 ,2 ,3 ,4 ]
Deng, Qingqing [5 ,6 ]
Liu, Zhuo [1 ,4 ]
Yan, Xiang [1 ,2 ,3 ]
Sun, Weiping [5 ,6 ]
机构
[1] Cent South Univ, Dept Metab & Endocrinol, Xiangya Hosp 2, Changsha, Peoples R China
[2] Natl Clin Res Ctr Metab Dis, Changsha, Peoples R China
[3] Cent South Univ, Key Lab Diabet Immunol, Minist Educ, Changsha, Peoples R China
[4] Loudi Cent Hosp, Dept Metab & Endocrinol, Loudi, Peoples R China
[5] First Peoples Hosp Xiangtan City, Dept Geriatr, Xiangtan, Peoples R China
[6] Jishou Univ, Dept Clin Med, Sch Med, Jishou, Peoples R China
基金
国家重点研发计划;
关键词
Fatty liver disease; Machine learning; Classification model; Extreme gradient boosting; RISK;
D O I
10.1159/000513654
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Fatty liver disease (FLD) has become a rampant condition. It is associated with a high rate of morbidity and mortality in a population. The condition is commonly referred as FLD. Early prediction of FLD would allow patients to take necessary preventive, diagnosis, and treatment. The main objective of this research is to develop a machine learning (ML) model to predict FLD that can help medics to classify individuals at high risk of FLD, make novel diagnosis, management, and prevention for FLD. Methods: Total of 3,419 subjects were recruited with 845 having been screened for FLD. Classification models were used in the detection of the disease. These models include logistic regression (LR), random forest (RF), artificial neural networks (ANNs), k-nearest neighbors (KNNs), extreme gradient boosting (XGBoost), and linear discriminant analysis (LDA). Predictive accuracy was assessed by area under curve (AUC), sensitivity, specificity, positive predictive value, and negative predictive value. Results: We demonstrated that ML models give more accurate predictions, the best accuracy reached to 0.9415 in the XGBoost model. Feature importance analysis not only confirmed some well-known FLD risk factors, but also demonstrated several novel features for predicting the risk of FLD, such as hemoglobin. Conclusion: By implementing the XGBoost model, physicians can efficiently identify FLD in general patients; this would help in prevention, early treatment, and management of FLD.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 13 条
[1]   Classifier Approaches for Liver Steatosis using Ultrasound Images [J].
Andrade, Andreia ;
Silva, Jose Silvestre ;
Santos, Jaime ;
Belo-Soares, Pedro .
4TH CONFERENCE OF ENTERPRISE INFORMATION SYSTEMS - ALIGNING TECHNOLOGY, ORGANIZATIONS AND PEOPLE (CENTERIS 2012), 2012, 5 :763-770
[2]   Progression of NAFLD to diabetes mellitus, cardiovascular disease or cirrhosis [J].
Anstee, Quentin M. ;
Targher, Giovanni ;
Day, Christopher P. .
NATURE REVIEWS GASTROENTEROLOGY & HEPATOLOGY, 2013, 10 (06) :330-344
[3]   The epidemiology of non-alcoholic fatty liver disease [J].
Bellentani, Stefano .
LIVER INTERNATIONAL, 2017, 37 :81-84
[5]   Novel solutions for an old disease: Diagnosis of acute appendicitis with random forest, support vector machines, and artificial neural networks [J].
Hsieh, Chung-Ho ;
Lu, Ruey-Hwa ;
Lee, Nai-Hsin ;
Chiu, Wen-Ta ;
Hsu, Min-Huei ;
Li, Yu-Chuan .
SURGERY, 2011, 149 (01) :87-93
[6]   Prevalence of non-alcoholic fatty liver disease and risk factors for advanced fibrosis and mortality in the United States [J].
Le, Michael H. ;
Devaki, Pardha ;
Ha, Nghiem B. ;
Jun, Dae Won ;
Te, Helen S. ;
Cheung, Ramsey C. ;
Nguyen, Mindie H. .
PLOS ONE, 2017, 12 (03)
[7]   Computer Aided Diagnosis of Fatty Liver Ultrasonic Images Based on Support Vector Machine [J].
Li, Guokuan ;
Luo, Yu ;
Deng, Wei ;
Xu, Xiangyang ;
Liu, Aihua ;
Song, Enmin .
2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, :4768-+
[8]   Risk factors and predictors of non-alcoholic fatty liver disease in Taiwan [J].
Lin, Ying-Chin ;
Chou, Shou-Chu ;
Huang, Pai-Tsang ;
Chiou, Hung-Yi .
ANNALS OF HEPATOLOGY, 2011, 10 (02) :125-132
[9]   Body Mass Index and Risk of Nonalcoholic Fatty Liver Disease: Two Electronic Health Record Prospective Studies [J].
Loomis, A. Katrina ;
Kabadi, Shaum ;
Preiss, David ;
Hyde, Craig ;
Bonato, Vinicius ;
St Louis, Matthew ;
Desai, Jigar ;
Gill, Jason M. R. ;
Welsh, Paul ;
Waterworth, Dawn ;
Sattar, Naveed .
JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2016, 101 (03) :945-952
[10]   Central obesity and nonalcoholic fatty liver disease risk after adjusting for body mass index [J].
Pang, Qing ;
Zhang, Jing-Yao ;
Song, Si-Dong ;
Qu, Kai ;
Xu, Xin-Sen ;
Liu, Su-Shun ;
Liu, Chang .
WORLD JOURNAL OF GASTROENTEROLOGY, 2015, 21 (05) :1650-1662