Interpretable machine learning in predicting drug-induced liver injury among tuberculosis patients: model development and validation study

被引：4

作者：

Xiao, Yue ^{[1
]}

Chen, Yanfei ^{[1
]}

Huang, Ruijian ^{[1
]}

Jiang, Feng ^{[1
]}

Zhou, Jifang ^{[1
]}

Yang, Tianchi ^{[2
]}

机构：

[1] China Pharmaceut Univ, Sch Int Pharmaceut Business, Nanjing, Jiangsu, Peoples R China

[2] Ningbo Municipal Ctr Dis Control & Prevent, Inst TB Prevent & Control, 237 Yongfeng Rd, Ningbo, Zhejiang, Peoples R China

来源：

BMC MEDICAL RESEARCH METHODOLOGY | 2024年 / 24卷 / 01期

关键词：

Machine learning; Logistic regression; Tuberculosis; Drug-induced liver injury; Retrospective study; HEALTH; HEPATOTOXICITY; GUIDELINES;

D O I：

10.1186/s12874-024-02214-5

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background The objective of this research was to create and validate an interpretable prediction model for drug-induced liver injury (DILI) during tuberculosis (TB) treatment.Methods A dataset of TB patients from Ningbo City was used to develop models employing the eXtreme Gradient Boosting (XGBoost), random forest (RF), and the least absolute shrinkage and selection operator (LASSO) logistic algorithms. The model's performance was evaluated through various metrics, including the area under the receiver operating characteristic curve (AUROC) and the area under the precision recall curve (AUPR) alongside the decision curve. The Shapley Additive exPlanations (SHAP) method was used to interpret the variable contributions of the superior model.Results A total of 7,071 TB patients were identified from the regional healthcare dataset. The study cohort consisted of individuals with a median age of 47 years, 68.0% of whom were male, and 16.3% developed DILI. We utilized part of the high dimensional propensity score (HDPS) method to identify relevant variables and obtained a total of 424 variables. From these, 37 variables were selected for inclusion in a logistic model using LASSO. The dataset was then split into training and validation sets according to a 7:3 ratio. In the validation dataset, the XGBoost model displayed improved overall performance, with an AUROC of 0.89, an AUPR of 0.75, an F1 score of 0.57, and a Brier score of 0.07. Both SHAP analysis and XGBoost model highlighted the contribution of baseline liver-related ailments such as DILI, drug-induced hepatitis (DIH), and fatty liver disease (FLD). Age, alanine transaminase (ALT), and total bilirubin (Tbil) were also linked to DILI status.Conclusion XGBoost demonstrates improved predictive performance compared to RF and LASSO logistic in this study. Moreover, the introduction of the SHAP method enhances the clinical understanding and potential application of the model. For further research, external validation and more detailed feature integration are necessary.

引用

页数：10

共 50 条

[31] Predicting antitubercular drug-induced liver injury and its outcome and introducing a novel scoring system
Mani, Selvin Sundar Raj
Iyyadurai, Ramya
Mishra, Ajay Kumar
Manjunath, Krishna
Prasad, Jasmin
Lakshmanan, Jeyaseelan
Yadav, Bijesh
Reginald, Alex
Jasmine, Sudha
Hansdak, Samuel George
Zachariah, Anand
INTERNATIONAL JOURNAL OF MYCOBACTERIOLOGY, 2021, 10 (02) : 116 - +
[32] Predicting drug-induced liver injury: The importance of data curation
Kotsampasakou, Eleni
Montanari, Floriane
Ecker, Gerhard F.
TOXICOLOGY, 2017, 389 : 139 - 145
[33] Deep Learning Algorithm Based on Molecular Fingerprint for Prediction of Drug-Induced Liver Injury
Yang, Qiong
Zhang, Shuwei
Li, Yan
TOXICOLOGY, 2024, 502
[34] Development and validation of an interpretable machine learning model for predicting the risk of distant metastasis in papillary thyroid cancer: a multicenter study
Hou, Fei
Zhu, Yun
Zhao, Hongbo
Cai, Haolin
Wang, Yinghui
Peng, Xiaoqi
Lu, Lin
He, Rongli
Hou, Yan
Li, Zhenhui
Chen, Ting
ECLINICALMEDICINE, 2024, 77
[35] Outcome and determinants of mortality in 269 patients with combination anti-tuberculosis drug-induced liver injury
Devarbhavi, Harshad
Singh, Rajvir
Patil, Mallikarjun
Sheth, Keyur
Adarsh, Channagiri Krishnamurthy
Balaraju, Girisha
JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2013, 28 (01) : 161 - 167
[36] Risk Prediction of Liver Injury in Pediatric Tuberculosis Treatment: Development of an Automated Machine Learning Model
Zeng, Ying
Lu, Hong
Li, Sen
Shi, Qun-Zhi
Liu, Lin
Gong, Yong-Qing
Yan, Pan
DRUG DESIGN DEVELOPMENT AND THERAPY, 2025, 19 : 239 - 250
[37] Drug-Induced Liver Injury and Drug Development: Industry Perspective
Regev, Arie
SEMINARS IN LIVER DISEASE, 2014, 34 (02) : 227 - 239
[38] Early sepsis mortality prediction model based on interpretable machine learning approach: development and validation study
Wang, Yiping
Gao, Zhihong
Zhang, Yang
Lu, Zhongqiu
Sun, Fangyuan
INTERNAL AND EMERGENCY MEDICINE, 2024, : 909 - 918
[39] Rechallenge after anti-tuberculosis drug-induced liver injury in a high HIV prevalence cohort
Moosa, Muhammed Shiraz
Maartens, Gary
Gunter, Hannah
Allie, Shaazia
Chughlay, Mohamed F.
Setshedi, Mashiko
Wasserman, Sean
Stead, David F.
Cohen, Karen
SOUTHERN AFRICAN JOURNAL OF HIV MEDICINE, 2022, 23 (01) : 1 - 5
[40] Artificial Intelligence and Machine Learning Models for Predicting Drug-Induced Kidney Injury in Small Molecules
Rao, Mohan
Nassiri, Vahid
Srivastava, Sanjay
Yang, Amy
Brar, Satjit
Mcduffie, Eric
Sachs, Clifford
PHARMACEUTICALS, 2024, 17 (11)

← 1 2 3 4 5 →