Machine Learning Models for Identification and Prediction of Toxic Organic Compounds Using Daphnia magna Transcriptomic Profiles

被引:6
作者
Choi, Tae-June [1 ]
An, Hyung-Eun [1 ]
Kim, Chang-Bae [1 ]
机构
[1] Sangmyung Univ, Dept Biotechnol, Seoul 03016, South Korea
来源
LIFE-BASEL | 2022年 / 12卷 / 09期
关键词
environmental monitoring; aquatic ecosystem; toxic organic compounds; Daphnia magna; transcriptomic profiles; machine learning; random forest; CLASSIFICATION;
D O I
10.3390/life12091443
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A wide range of environmental factors heavily impact aquatic ecosystems, in turn, affecting human health. Toxic organic compounds resulting from anthropogenic activity are a source of pollution in aquatic ecosystems. To evaluate these contaminants, current approaches mainly rely on acute and chronic toxicity tests, but cannot provide explicit insights into the causes of toxicity. As an alternative, genome-wide gene expression systems allow the identification of contaminants causing toxicity by monitoring the organisms' response to toxic substances. In this study, we selected 22 toxic organic compounds, classified as pesticides, herbicides, or industrial chemicals, that induce environmental problems in aquatic ecosystems and affect human-health. To identify toxic organic compounds using gene expression data from Daphnia magna, we evaluated the performance of three machine learning based feature-ranking algorithms (Learning Vector Quantization, Random Forest, and Support Vector Machines with a Linear kernel), and nine classifiers (Linear Discriminant Analysis, Classification And Regression Trees, K-nearest neighbors, Support Vector Machines with a Linear kernel, Random Forest, Boosted C5.0, Gradient Boosting Machine, eXtreme Gradient Boosting with tree, and eXtreme Gradient Boosting with DART booster). Our analysis revealed that a combination of feature selection based on feature-ranking and a random forest classification algorithm had the best model performance, with an accuracy of 95.7%. This is a preliminary study to establish a model for the monitoring of aquatic toxic substances by machine learning. This model could be an effective tool to manage contaminants and toxic organic compounds in aquatic systems.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Cybercrime: Identification and Prediction Using Machine Learning Techniques
    Veena, K.
    Meena, K.
    Kuppusamy, Ramya
    Teekaraman, Yuvaraja
    Angadi, Ravi V.
    Thelkar, Amruth Ramesh
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [32] Identification of hypertension subtypes using microRNA profiles and machine learning
    Reel, Smarti
    Reel, Parminder S.
    Van Kralingen, Josie
    Larsen, Casper K.
    Robertson, Stacy
    MacKenzie, Scott M.
    Riddell, Alexandra
    McClure, John D.
    Lamprou, Stelios
    Connell, John M. C.
    Amar, Laurence
    Pecori, Alessio
    Tetti, Martina
    Pamporaki, Christina
    Kabat, Marek
    Ceccato, Filippo
    Kroiss, Matthias
    Dennedy, Michael C.
    Stell, Anthony
    Deinum, Jaap
    Mulatero, Paolo
    Reincke, Martin
    Gimenez-Roqueplo, Anne-Paule
    Assie, Guillaume
    Blanchard, Anne
    Beuschlein, Felix
    Rossi, Gian Paolo
    Eisenhofer, Graeme
    Zennaro, Maria-Christina
    Jefferson, Emily
    Davies, Eleanor
    EUROPEAN JOURNAL OF ENDOCRINOLOGY, 2025, 192 (04) : 418 - 428
  • [33] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [34] Cocrystal Prediction Using Machine Learning Models and Descriptors
    Mswahili, Medard Edmund
    Lee, Min-Jeong
    Martin, Gati Lother
    Kim, Junghyun
    Kim, Paul
    Choi, Guang J.
    Jeong, Young-Seob
    APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 12
  • [35] Prediction of Frailty Grade Using Machine Learning Models
    Erdas, Cagatay Berke
    Olcer, Didem
    2022 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO'22), 2022,
  • [36] Dangerous prediction in roads by using machine learning models
    Satla S.P.
    Sadanandam M.
    Suvarna B.
    Ingenierie des Systemes d'Information, 2020, 25 (05): : 637 - 644
  • [37] Efficiency Prediction for Organic Photovoltaic Cells Using Molecular Fingerprints and Machine Learning Regression Models
    Zheng Y.
    Liang X.
    Zhang Q.
    Sun W.
    Shi T.
    Du J.
    Sun K.
    Cailiao Daobao/Materials Reports, 2021, 35 (08): : 8207 - 8212
  • [38] ABDpred: Prediction of active antimicrobial compounds using supervised machine learning techniques
    Jana, Tanmoy
    Sarkar, Debasree
    Ganguli, Debayan
    Mukherjee, Sandip Kumar
    Mandal, Rahul Shubhra
    Das, Santasabuj
    INDIAN JOURNAL OF MEDICAL RESEARCH, 2024, 159 (01) : 78 - 90
  • [39] Prediction of activity and selectivity profiles of human Carbonic Anhydrase inhibitors using machine learning classification models
    Annachiara Tinivella
    Luca Pinzi
    Giulio Rastelli
    Journal of Cheminformatics, 13
  • [40] In silico prediction of ocular toxicity of compounds using explainable machine learning and deep learning approaches
    Zhou, Yiqing
    Wang, Ze
    Huang, Zejun
    Li, Weihua
    Chen, Yuanting
    Yu, Xinxin
    Tang, Yun
    Liu, Guixia
    JOURNAL OF APPLIED TOXICOLOGY, 2024, 44 (06) : 892 - 907