What is the ecotoxicity of a given chemical for a given aquatic species? Predicting interactions between species and chemicals using recommender system techniques

被引:3
作者
Viljanen, M. [1 ]
Minnema, J. [2 ]
Wassenaar, P. N. H. [2 ]
Rorije, E. [2 ]
Peijnenburg, W. [2 ,3 ]
机构
[1] Natl Inst Publ Hlth & Environm, Dept Stat Data Sci & Modelling, Bilthoven, Netherlands
[2] Natl Inst Publ Hlth & Environm, Ctr Safety Subst & Prod, Bilthoven, Netherlands
[3] Leiden Univ, Inst Environm Sci CML, Leiden, Netherlands
关键词
Machine learning; modelling; aquatic toxicity; species sensitivity; prediction; LC50; QUANTITATIVE TOXICITY PREDICTION; DEEP;
D O I
10.1080/1062936X.2023.2254225
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Ecotoxicological safety assessment of chemicals requires toxicity data on multiple species, despite the general desire of minimizing animal testing. Predictive models, specifically machine learning (ML) methods, are one of the tools capable of solving this apparent contradiction as they allow to generalize toxicity patterns across chemicals and species. However, despite the availability of large public toxicity datasets, the data is highly sparse, complicating model development. The aim of this study is to provide insights into how ML can predict toxicity using a large but sparse dataset. We developed models to predict LC50-values, based on experimental LC50-data covering 2431 organic chemicals and 1506 aquatic species from the ECOTOX-database. Several well-known ML techniques were evaluated and a new ML model was developed, inspired by recommender systems. This new model involves a simple linear model that learns low-rank interactions between species and chemicals using factorization machines. We evaluated the predictive performances of the developed models based on two validation settings: 1) predicting unseen chemical-species pairs, and 2) predicting unseen chemicals. The results of this study show that ML models can accurately predict LC50-values in both validation settings. Moreover, we show that the novel factorization machine approach can match well-tuned, complex, ML approaches.
引用
收藏
页码:765 / 788
页数:24
相关论文
共 67 条
  • [1] Integrative and Personalized QSAR Analysis in Cancer by Kernelized Bayesian Matrix Factorization
    Amnnad-ud-din, Muhammad
    Georgii, Elisabeth
    Gonen, Mehmet
    Laitinen, Tuomo
    Kallioniemi, Olli
    Wennerberg, Krister
    Poso, Antti
    Kaski, Samuel
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (08) : 2347 - 2359
  • [2] Machine learning approaches and databases for prediction of drug-target interaction: a survey paper
    Bagherian, Maryam
    Sabeti, Elyas
    Wang, Kai
    Sartor, Maureen A.
    Nikolovska-Coleska, Zaneta
    Najarian, Kayvan
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) : 247 - 269
  • [3] Modeling the toxicity of chemical pesticides in multiple test species using local and global QSTR approaches
    Basant, Nikita
    Gupta, Shikha
    Singh, Kunwar P.
    [J]. TOXICOLOGY RESEARCH, 2016, 5 (01) : 340 - 353
  • [4] Artificial Intelligence for Drug Toxicity and Safety
    Basile, Anna O.
    Yahi, Alexandre
    Tatonetti, Nicholas P.
    [J]. TRENDS IN PHARMACOLOGICAL SCIENCES, 2019, 40 (09) : 624 - 635
  • [5] Basilico J., 2004, P 21 INT C MACH LEAR, DOI [10.1145/1015330.1015394, DOI 10.1145/1015330.1015394]
  • [6] Testing developmental toxicity in a second species: are the differences due to species or replication error?
    Braakhuis, Hedwig M.
    Theunissen, Peter T.
    Slob, Wout
    Rorije, Emiel
    Piersma, Aldert H.
    [J]. REGULATORY TOXICOLOGY AND PHARMACOLOGY, 2019, 107
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [9] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [10] Machine Learning for Drug-Target Interaction Prediction
    Chen, Ruolan
    Liu, Xiangrong
    Jin, Shuting
    Lin, Jiawei
    Liu, Juan
    [J]. MOLECULES, 2018, 23 (09):