Prediction of Compounds Activity in Nuclear Receptor Signaling and Stress Pathway Assays Using Machine Learning Algorithms and Low-Dimensional Molecular Descriptors

被引:26
作者
Stefaniak, Flip [1 ]
机构
[1] Int Inst Mol & Cell Biol Warsaw, Lab Bioinformat & Prot Engn, Warsaw, Poland
基金
欧洲研究理事会;
关键词
toxicity prediction; machine learning; molecular descriptors; molecular fingerprints; Tox21 Data Challenge 2014;
D O I
10.3389/fenvs.2015.00077
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Toxicity evaluation of newly synthesized or used compounds is one of the main challenges during product development in many areas of industry. For example, toxicity is the second reason after lack of efficacy for failure in preclinical and clinical studies of drug candidates. To avoid attrition at the late stage of the drug development process, the toxicity analyses are employed at the early stages of a discovery pipeline, along with activity and selectivity enhancing. Although many assays for screening its vitro toxicity are available, their massive application is not always time and cost effective. Thus, the need for fast and reliable its silico tools, which can be used not only for toxicity prediction of existing compounds, but also for prioritization of compounds planned for synthesis or acquisition. Here I present the benchmark results of the combination of various attribute selection methods and machine learning algorithms and their application to the data sets of the Tox21 Data Challenge. The best performing method: Best First for attribute selection with the Rotation Forest/ADTree classifier offers good accuracy for most tested cases. For 11 out of 12 targets, the AUROC value for the final evaluation set was =0.72, while for three targets the AUROC value was = 0.80, with the average AUROC being 0.784 +/- 0.069. The use of two-dimensional descriptors sets enables fast screening and compound prioritization even for a very large database. Open source tools used in this project make the presented approach widely available and encourage the community to further improve the presented scheme.
引用
收藏
页数:7
相关论文
共 38 条
[1]   Comparison of In Silico Models for Prediction of Mutagenicity [J].
Bakhtyari, Nazanin G. ;
Raitano, Giuseppa ;
Benfenati, Emilio ;
Martin, Todd ;
Young, Douglas .
JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH PART C-ENVIRONMENTAL CARCINOGENESIS & ECOTOXICOLOGY REVIEWS, 2013, 31 (01) :45-66
[2]   KNIME-CDK: Workflow-driven cheminformatics [J].
Beisken, Stephan ;
Meinl, Thorsten ;
Wiswedel, Bernd ;
de Figueiredo, Luis F. ;
Berthold, Michael ;
Steinbeck, Christoph .
BMC BIOINFORMATICS, 2013, 14
[3]   KNIME:: The Konstanz Information Miner [J].
Berthold, Michael R. ;
Cebron, Nicolas ;
Dill, Fabian ;
Gabriel, Thomas R. ;
Koetter, Tobias ;
Meinl, Thorsten ;
Ohl, Peter ;
Sieb, Christoph ;
Thiel, Kilian ;
Wiswedel, Bernd .
DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, :319-326
[4]   Towards Global QSAR Model Building for Acute Toxicity: Munro Database Case Study [J].
Chavan, Swapnil ;
Nicholls, Ian A. ;
Karlsson, Bjorn C. G. ;
Rosengren, Annika M. ;
Ballabio, Davide ;
Consonni, Viviana ;
Todeschini, Roberto .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (10) :18162-18174
[5]  
Cheng FX, 2013, CURR TOP MED CHEM, V13, P1273
[6]   A PRACTICAL IN-HOME ILLUMINATION CONSIDERATION TO REDUCE DATA RATE FLUCTUATION IN VISIBLE LIGHT COMMUNICATION [J].
Chow, C. W. ;
Liu, Y. ;
Yeh, C. H. ;
Sung, J. Y. ;
Liu, Y. L. .
IEEE WIRELESS COMMUNICATIONS, 2015, 22 (02) :17-23
[7]   In Silico Quantitative Structure Toxicity Relationship of Chemical Compounds: Some Case Studies [J].
Deeb, Omar ;
Goodarzi, Mohammad .
CURRENT DRUG SAFETY, 2012, 7 (04) :289-297
[8]   Molecular similarity-based predictions of the Tox21 screening outcome [J].
Drwal, Malgorzata N. ;
Siramshetty, Vishal B. ;
Banerjee, Priyanka ;
Goede, Andrean ;
Preissner, Robert ;
Dunkel, Mathias .
FRONTIERS IN ENVIRONMENTAL SCIENCE, 2015, 3
[9]   Machine-Learning Techniques Applied to Antibacterial Drug Discovery [J].
Durrant, Jacob D. ;
Amaro, Rommie E. .
CHEMICAL BIOLOGY & DRUG DESIGN, 2015, 85 (01) :14-21
[10]  
Erickson BE, 2011, CHEM ENG NEWS, V89, P25