Interpretable machine learning in predicting drug-induced liver injury among tuberculosis patients: model development and validation study

被引:4
作者
Xiao, Yue [1 ]
Chen, Yanfei [1 ]
Huang, Ruijian [1 ]
Jiang, Feng [1 ]
Zhou, Jifang [1 ]
Yang, Tianchi [2 ]
机构
[1] China Pharmaceut Univ, Sch Int Pharmaceut Business, Nanjing, Jiangsu, Peoples R China
[2] Ningbo Municipal Ctr Dis Control & Prevent, Inst TB Prevent & Control, 237 Yongfeng Rd, Ningbo, Zhejiang, Peoples R China
关键词
Machine learning; Logistic regression; Tuberculosis; Drug-induced liver injury; Retrospective study; HEALTH; HEPATOTOXICITY; GUIDELINES;
D O I
10.1186/s12874-024-02214-5
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background The objective of this research was to create and validate an interpretable prediction model for drug-induced liver injury (DILI) during tuberculosis (TB) treatment.Methods A dataset of TB patients from Ningbo City was used to develop models employing the eXtreme Gradient Boosting (XGBoost), random forest (RF), and the least absolute shrinkage and selection operator (LASSO) logistic algorithms. The model's performance was evaluated through various metrics, including the area under the receiver operating characteristic curve (AUROC) and the area under the precision recall curve (AUPR) alongside the decision curve. The Shapley Additive exPlanations (SHAP) method was used to interpret the variable contributions of the superior model.Results A total of 7,071 TB patients were identified from the regional healthcare dataset. The study cohort consisted of individuals with a median age of 47 years, 68.0% of whom were male, and 16.3% developed DILI. We utilized part of the high dimensional propensity score (HDPS) method to identify relevant variables and obtained a total of 424 variables. From these, 37 variables were selected for inclusion in a logistic model using LASSO. The dataset was then split into training and validation sets according to a 7:3 ratio. In the validation dataset, the XGBoost model displayed improved overall performance, with an AUROC of 0.89, an AUPR of 0.75, an F1 score of 0.57, and a Brier score of 0.07. Both SHAP analysis and XGBoost model highlighted the contribution of baseline liver-related ailments such as DILI, drug-induced hepatitis (DIH), and fatty liver disease (FLD). Age, alanine transaminase (ALT), and total bilirubin (Tbil) were also linked to DILI status.Conclusion XGBoost demonstrates improved predictive performance compared to RF and LASSO logistic in this study. Moreover, the introduction of the SHAP method enhances the clinical understanding and potential application of the model. For further research, external validation and more detailed feature integration are necessary.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Deep Learning for Drug-Induced Liver Injury
    Xu, Youjun
    Dai, Ziwei
    Chen, Fangjin
    Gao, Shuaishi
    Pei, Jianfeng
    Lai, Luhua
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (10) : 2085 - 2093
  • [22] Drug properties and host factors contribute to biochemical presentation of drug-induced liver injury: a prediction model from a machine learning approach
    Andres Gonzalez-Jimenez
    Ayako Suzuki
    Minjun Chen
    Kristin Ashby
    Ismael Alvarez-Alvarez
    Raul J. Andrade
    M. Isabel Lucena
    Archives of Toxicology, 2021, 95 : 1793 - 1803
  • [23] Drug properties and host factors contribute to biochemical presentation of drug-induced liver injury: a prediction model from a machine learning approach
    Gonzalez-Jimenez, Andres
    Suzuki, Ayako
    Chen, Minjun
    Ashby, Kristin
    Alvarez-Alvarez, Ismael
    Andrade, Raul J.
    Lucena, M. Isabel
    ARCHIVES OF TOXICOLOGY, 2021, 95 (05) : 1793 - 1803
  • [24] Prior drug allergies are associated with worse outcome in patients with idiosyncratic drug-induced liver injury: A machine learning approach for risk stratification
    Niu, Hao
    Solis-Munoz, Pablo
    Garcia-Cortes, Miren
    Sanabria-Cabrera, Judith
    Robles-Diaz, Mercedes
    Romero-Flores, Rocio
    Bonilla-Toyos, Elvira
    Ortega-Alonso, Aida
    Pinazo-Bandera, Jose M.
    Cabello, Maria R.
    Bessone, Fernando
    Hernandez, Nelia
    Lucena, M. Isabel
    Andrade, Raul J.
    Medina-Caliz, Inmaculada
    Alvarez-Alvarez, Ismael
    PHARMACOLOGICAL RESEARCH, 2024, 199
  • [25] Drug-Induced Liver Injury from Anti-Tuberculosis Treatment: A Retrospective Cohort Study
    Zhao, Hong
    Wang, Yanbing
    Zhang, Ting
    Wang, Qi
    Xie, Wen
    MEDICAL SCIENCE MONITOR, 2020, 26
  • [26] Machine Learning to Identify Interaction of Single-Nucleotide Polymorphisms as a Risk Factor for Chronic Drug-Induced Liver Injury
    Moore, Roland
    Ashby, Kristin
    Liao, Tsung-Jen
    Chen, Minjun
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (20)
  • [27] Interpretable machine learning model for predicting acute kidney injury in critically ill patients
    Li, Xunliang
    Wang, Peng
    Zhu, Yuke
    Zhao, Wenman
    Pan, Haifeng
    Wang, Deguang
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [28] Anti-Tuberculosis Drug-Induced Liver Injury in Shanghai: Validation of Hy's Law
    Shen, Xin
    Yuan, Zheng'an
    Mei, Jian
    Zhang, Zurong
    Guo, Juntao
    Wu, Zheyuan
    Wu, Jie
    Zhang, Haihua
    Pan, Jieping
    Huang, Wenming
    Gong, Huili
    Yuan, Dong
    Xiao, Ping
    Wang, Yanqin
    Shuai, Yi
    Lin, Senlin
    Pan, Qichao
    Zhou, Tong
    Watkins, Paul B.
    Wu, Fan
    DRUG SAFETY, 2014, 37 (01) : 43 - 51
  • [29] Zebrafish as model organisms for studying drug-induced liver injury
    Vliegenthart, A. D. Bastiaan
    Tucker, Carl S.
    Del Pozo, Jorge
    Dear, James W.
    BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2014, 78 (06) : 1217 - 1227
  • [30] DILI-Stk: An Ensemble Model for the Prediction of Drug-induced Liver Injury of Drug Candidates
    Lee, Jingyu
    Yu, Myeong-sang
    Na, Dokyun
    CURRENT BIOINFORMATICS, 2022, 17 (03) : 296 - 303