Evaluation of Different Methods for Identification of Structural Alerts Using Chemical Ames Mutagenicity Data Set as a Benchmark

被引:54
作者
Yang, Hongbin [1 ]
Li, Jie [1 ]
Wu, Zengrui [1 ]
Li, Weihua [1 ]
Liu, Guixia [1 ]
Tang, Yun [1 ]
机构
[1] East China Univ Sci & Technol, Sch Pharm, Shanghai Key Lab New Drug Design, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
IN-SILICO PREDICTION; SALMONELLA MUTAGENICITY; TOXICITY; CARCINOGENICITY; DERIVATION; SUBSTRUCTURES; VALIDATION; DISCOVERY; LIBRARY; BINDING;
D O I
10.1021/acs.chemrestox.7b00083
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Identification of structural alerts for toxicity is useful in drug discovery and other fields such as environmental protection. With structural alerts, researchers can quickly identify potential toxic compounds and learn how to modify them. Hence, it is important to determine structural alerts from a large number of compounds quickly and accurately. There are already many methods reported for identification of structural alerts. However, how to evaluate those methods is a problem. In this paper, we tried to evaluate four of the methods for monosubstructure identification with three indices including accuracy rate, coverage rate, and information gain to compare their advantages and disadvantages. The Kazins' Ames mutagenicity data set was used as the benchmark, and the four methods were MoSS (graph-based), SARpy (fragment-based), and, two fingerprint-based methods including Bioalerts and the fingerprint (FP) method we previously used. The results showed that Bioalerts and FP could detect key substructures with high accuracy and coverage rates because they allowed unclosed rings and wildcard atom or bond-types. However, they also resulted in redundancy so that their predictive performance was not as good as that of SARpy. SARpy was competitive in predictive performance in both training set and external validation set. these results might be helpful for users to select appropriate methods and further development of, methods for identification of structural alerts.
引用
收藏
页码:1355 / 1364
页数:10
相关论文
共 61 条
[1]   Computational Derivation of Structural Alerts from Large Toxicology Data Sets [J].
Ahlberg, Ernst ;
Carlsson, Lars ;
Boyer, Scott .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (10) :2945-2952
[2]  
[Anonymous], 2012, RUL BAS SYST DES SUP
[3]  
[Anonymous], 2009, PUBCHEM SUBSTRUCTURE
[4]  
[Anonymous], 2007, KNIME KONSTANZ INFOR
[5]   CHEMICAL-STRUCTURE, SALMONELLA MUTAGENICITY AND EXTENT OF CARCINOGENICITY AS INDICATORS OF GENOTOXIC CARCINOGENESIS AMONG 222 CHEMICALS TESTED IN RODENTS BY THE UNITED-STATES NCI/NTP [J].
ASHBY, J ;
TENNANT, RW .
MUTATION RESEARCH, 1988, 204 (01) :17-115
[6]   The use of structure-activity relationship analysis in the food contact notification program [J].
Bailey, AB ;
Chanderbhan, R ;
Collazo-Braier, N ;
Cheeseman, MA ;
Twaroski, ML .
REGULATORY TOXICOLOGY AND PHARMACOLOGY, 2005, 42 (02) :225-235
[7]   Structure alerts for carcinogenicity, and the Salmonella assay system:: A novel insight through the chemical relational databases technology [J].
Benigni, Romualdo ;
Bossa, Cecilia .
MUTATION RESEARCH-REVIEWS IN MUTATION RESEARCH, 2008, 659 (03) :248-261
[8]   Structural Alerts of Mutagens and Carcinogens [J].
Benigni, Romualdo ;
Bossa, Cecilia .
CURRENT COMPUTER-AIDED DRUG DESIGN, 2006, 2 (02) :169-176
[9]   Assessment and Validation of US EPA's OncoLogic® Expert System and Analysis of Its Modulating Factors for Structural Alerts [J].
Benigni, Romualdo ;
Bossa, Cecilia ;
Alivernini, Silvia ;
Colafranceschi, Mauro .
JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH PART C-ENVIRONMENTAL CARCINOGENESIS & ECOTOXICOLOGY REVIEWS, 2012, 30 (02) :152-173
[10]  
Bertrand C., 2012, CONTRAST DATA MINING