Machine learning prediction of dioxin lipophilicity and key feature Identification

被引:0
|
作者
Wang, Yingwei [1 ]
Li, Yufei [1 ]
机构
[1] Northeast Forestry Univ, Coll Forestry, 26 Hexing Rd, Harbin 150040, Peoples R China
关键词
Dioxins; Lipophilicity; Machine learning; SHAP; Immunotoxicity; POLYCHLORINATED-BIPHENYLS PCBS; DIBENZOFURANS PCDFS; EXPOSURE; PCDDS;
D O I
10.1016/j.comptc.2024.115032
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Dioxins are potent exogenous ligands for the aryl hydrocarbon receptor (AHR) within human cell membranes. Their lipophilicity is a critical factor influencing the immunotoxicity mediated by AHR. This study utilizes experimental data on the lipophilicity of certain PCDDs as the dependent variable, and molecular descriptors of PCDDs as independent variables, to construct five machine learning models for predicting PCDDs' lipophilicity. The evaluation metrics of these models indicate that the XGBoost model exhibits excellent predictive performance, achieving an 86% accuracy in predicting the logKow values of 75 PCDDs. An XGBoost-Bayesian stacked model was developed by employing a stacking algorithm, enhancing the prediction accuracy to 96%. This improved model was successfully applied to predict the logKow values of 175 PCDFs and validated through molecular membrane dynamics. The SHAP method identified key molecular descriptors influencing dioxin lipophilicity. This study offers a theoretical basis for investigating the toxicity of dioxins via AHR receptors.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Key feature identification of internal kink mode using machine learning
    Ning, Hongwei
    Lou, Shuyong
    Wu, Jianguo
    Zhou, Teng
    FRONTIERS IN PHYSICS, 2024, 12
  • [2] Lipophilicity prediction of peptides and peptide derivatives by consensus machine learning
    Fuchs, Jens-Alexander
    Grisoni, Francesca
    Kossenjans, Michael
    Hiss, Jan A.
    Schneider, Gisbert
    MEDCHEMCOMM, 2018, 9 (09) : 1538 - 1546
  • [3] Machine learning-based genetic feature identification and fatigue life prediction
    Zhou, Kun
    Sun, Xingyue
    Shi, Shouwen
    Song, Kai
    Chen, Xu
    FATIGUE & FRACTURE OF ENGINEERING MATERIALS & STRUCTURES, 2021, 44 (09) : 2524 - 2537
  • [4] Identification and prediction of key nucleotide sites using machine learning in Bioinformatics: A brief overview
    Cai, Jianhua
    Wei, Leyi
    Zeng, Kun
    Xiao, Guobao
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1194 - 1200
  • [5] Identification of Individualized Feature Combinations for Survival Prediction in Breast Cancer: A Comparison of Machine Learning Techniques
    Vanneschi, Leonardo
    Farinaccio, Antonella
    Giacobini, Mario
    Mauri, Giancarlo
    Antoniotti, Marco
    Provero, Paolo
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2010, 6023 : 110 - +
  • [6] An explainable machine learning platform for pyrazinamide resistance prediction and genetic feature identification of Mycobacterium tuberculosis
    Zhang, Andrew
    Teng, Ling
    Alterovitz, Gil
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (03) : 533 - 540
  • [7] Feature selection and machine learning methods for optimal identification and prediction of subtypes in Parkinson's disease
    Salmanpour, R. Mohammad
    Shamsaei, Mojtaba
    Rahmim, Arman
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 206
  • [8] Feature based quality prediction through machine learning
    Brecher C.
    Ochel J.
    Lohrmann V.
    Fey M.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2019, 114 (11): : 784 - 787
  • [9] Prediction of phytoplankton biomass and identification of key influencing factors using interpretable machine learning models
    Xu, Yi
    Zhang, Di
    Lin, Junqiang
    Peng, Qidong
    Lei, Xiaohui
    Jin, Tiantian
    Wang, Jia
    Yuan, Ruifang
    ECOLOGICAL INDICATORS, 2024, 158
  • [10] Accurate prediction and key protein sequence feature identification of cyclins
    Yu, Shaoyou
    Liao, Bo
    Zhu, Wen
    Peng, Dejun
    Wu, Fangxiang
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2023, 22 (05) : 411 - 419