Interpretable machine learning in predicting drug-induced liver injury among tuberculosis patients: model development and validation study

被引:4
作者
Xiao, Yue [1 ]
Chen, Yanfei [1 ]
Huang, Ruijian [1 ]
Jiang, Feng [1 ]
Zhou, Jifang [1 ]
Yang, Tianchi [2 ]
机构
[1] China Pharmaceut Univ, Sch Int Pharmaceut Business, Nanjing, Jiangsu, Peoples R China
[2] Ningbo Municipal Ctr Dis Control & Prevent, Inst TB Prevent & Control, 237 Yongfeng Rd, Ningbo, Zhejiang, Peoples R China
关键词
Machine learning; Logistic regression; Tuberculosis; Drug-induced liver injury; Retrospective study; HEALTH; HEPATOTOXICITY; GUIDELINES;
D O I
10.1186/s12874-024-02214-5
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background The objective of this research was to create and validate an interpretable prediction model for drug-induced liver injury (DILI) during tuberculosis (TB) treatment.Methods A dataset of TB patients from Ningbo City was used to develop models employing the eXtreme Gradient Boosting (XGBoost), random forest (RF), and the least absolute shrinkage and selection operator (LASSO) logistic algorithms. The model's performance was evaluated through various metrics, including the area under the receiver operating characteristic curve (AUROC) and the area under the precision recall curve (AUPR) alongside the decision curve. The Shapley Additive exPlanations (SHAP) method was used to interpret the variable contributions of the superior model.Results A total of 7,071 TB patients were identified from the regional healthcare dataset. The study cohort consisted of individuals with a median age of 47 years, 68.0% of whom were male, and 16.3% developed DILI. We utilized part of the high dimensional propensity score (HDPS) method to identify relevant variables and obtained a total of 424 variables. From these, 37 variables were selected for inclusion in a logistic model using LASSO. The dataset was then split into training and validation sets according to a 7:3 ratio. In the validation dataset, the XGBoost model displayed improved overall performance, with an AUROC of 0.89, an AUPR of 0.75, an F1 score of 0.57, and a Brier score of 0.07. Both SHAP analysis and XGBoost model highlighted the contribution of baseline liver-related ailments such as DILI, drug-induced hepatitis (DIH), and fatty liver disease (FLD). Age, alanine transaminase (ALT), and total bilirubin (Tbil) were also linked to DILI status.Conclusion XGBoost demonstrates improved predictive performance compared to RF and LASSO logistic in this study. Moreover, the introduction of the SHAP method enhances the clinical understanding and potential application of the model. For further research, external validation and more detailed feature integration are necessary.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] AMALPHI: A Machine Learning Platform for Predicting Drug-Induced PhospholIpidosis
    Lomuscio, Maria Cristina
    Abate, Carmen
    Alberga, Domenico
    Laghezza, Antonio
    Corriero, Nicola
    Colabufo, Nicola Antonio
    Saviano, Michele
    Delre, Pietro
    Mangiatordi, Giuseppe Felice
    MOLECULAR PHARMACEUTICS, 2023, 21 (02) : 864 - 872
  • [42] A nomogram model to predict the risk of drug-induced liver injury in patients receiving anti-tuberculosis treatment
    Ji, Songjun
    Lu, Bin
    Pan, Xinling
    FRONTIERS IN PHARMACOLOGY, 2023, 14
  • [43] Development and validation of an interpretable machine learning model for predicting the risk of hepatocellular carcinoma in patients with chronic hepatitis B: a case-control study
    Wu, Linghong
    Liu, Zengjing
    Huang, Hongyuan
    Pan, Dongmei
    Fu, Cuiping
    Lu, Yao
    Zhou, Min
    Huang, Kaiyong
    Huang, Tianren
    Yang, Li
    BMC GASTROENTEROLOGY, 2025, 25 (01)
  • [44] Treatment outcomes among patients admitted to hospital with antiretroviral and/or antituberculosis drug-induced liver injury
    Mehta, R.
    Ive, P.
    Evans, D.
    Menezes, C. N.
    SAMJ SOUTH AFRICAN MEDICAL JOURNAL, 2021, 111 (05): : 474 - 481
  • [45] Drug-Induced Liver Injury in Patients With Chronic Liver Disease
    Ghabril, Marwan
    Vuppalanchi, Raj
    Chalasani, Naga
    LIVER INTERNATIONAL, 2025, 45 (03)
  • [46] Identification of Drug-Induced Liver Injury Biomarkers from Multiple Microarrays Based on Machine Learning and Bioinformatics Analysis
    Wang, Kaiyue
    Zhang, Lin
    Li, Lixia
    Wang, Yi
    Zhong, Xinqin
    Hou, Chunyu
    Zhang, Yuqi
    Sun, Congying
    Zhou, Qian
    Wang, Xiaoying
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (19)
  • [47] Improved prediction of drug-induced liver injury literature using natural language processing and machine learning methods
    Oh, Jung Hun
    Tannenbaum, Allen
    Deasy, Joseph O.
    FRONTIERS IN GENETICS, 2023, 14
  • [48] Prediction of Drug-Induced Liver Injury: From Molecular Physicochemical Properties and Scaffold Architectures to Machine Learning Approaches
    Zhao, Yulong
    Zhang, Zhoudong
    Kong, Xiaotian
    Wang, Kai
    Wang, Yaxuan
    Jia, Jie
    Li, Huanqiu
    Tian, Sheng
    CHEMICAL BIOLOGY & DRUG DESIGN, 2024, 104 (02)
  • [49] InterDILI: interpretable prediction of drug-induced liver injury through permutation feature importance and attention mechanism
    Soyeon Lee
    Sunyong Yoo
    Journal of Cheminformatics, 16
  • [50] InterDILI: interpretable prediction of drug-induced liver injury through permutation feature importance and attention mechanism
    Lee, Soyeon
    Yoo, Sunyong
    JOURNAL OF CHEMINFORMATICS, 2024, 16 (01)