An algorithm to identify functional groups in organic molecules

被引:79
作者
Ertl, Peter [1 ]
机构
[1] Novartis Inst BioMed Res, CH-4056 Basel, Switzerland
来源
JOURNAL OF CHEMINFORMATICS | 2017年 / 9卷
关键词
Functional group; Chemical functionality; Organic chemistry; Medicinal chemistry; IDENTIFICATION; CLASSIFICATION;
D O I
10.1186/s13321-017-0225-z
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: The concept of functional groups forms a basis of organic chemistry, medicinal chemistry, toxicity assessment, spectroscopy and also chemical nomenclature. All current software systems to identify functional groups are based on a predefined list of substructures. We are not aware of any program that can identify all functional groups in a molecule automatically. The algorithm presented in this article is an attempt to solve this scientific challenge. Results: An algorithm to identify functional groups in a molecule based on iterative marching through its atoms is described. The procedure is illustrated by extracting functional groups from the bioactive portion of the ChEMBL database, resulting in identification of 3080 unique functional groups. Conclusions: A new algorithm to identify all functional groups in organic molecules is presented. The algorithm is relatively simple and full details with examples are provided, therefore implementation in any cheminformatics toolkit should be relatively easy. The new method allows the analysis of functional groups in large chemical databases in a way that was not possible using previous approaches.
引用
收藏
页数:7
相关论文
共 13 条
[1]   New Substructure Filters for Removal of Pan Assay Interference Compounds (PAINS) from Screening Libraries and for Their Exclusion in Bioassays [J].
Baell, Jonathan B. ;
Holloway, Georgina A. .
JOURNAL OF MEDICINAL CHEMISTRY, 2010, 53 (07) :2719-2740
[2]   Automated compound classification using a chemical ontology [J].
Bobach, Claudia ;
Boehme, Timo ;
Laube, Ulf ;
Pueschel, Anett ;
Weber, Lutz .
JOURNAL OF CHEMINFORMATICS, 2012, 4
[3]   Rules for Identifying Potentially Reactive or Promiscuous Compounds [J].
Bruns, Robert F. ;
Watson, Ian A. .
JOURNAL OF MEDICINAL CHEMISTRY, 2012, 55 (22) :9763-9772
[4]   Cheminformatics analysis of organic substituents: identification of the most common substituents, calculation of substituent properties, and automatic identification of drug-like bioisosteric groups [J].
Ertl, P .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02) :374-380
[5]   Quest for the rings.: In silico exploration of ring universe to identify novel bioactive heteroaromatic scaffolds [J].
Ertl, Peter ;
Jelfs, Stephen ;
Muehlbacher, Joerg ;
Schuffenhauer, Ansgar ;
Selzer, Paul .
JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (15) :4568-4573
[6]   The Molecule Cloud - compact visualization of large collections of molecules [J].
Ertl, Peter ;
Rohde, Bernhard .
JOURNAL OF CHEMINFORMATICS, 2012, 4
[7]   CO: A chemical ontology for identification of functional groups and semantic comparison of small molecules [J].
Feldman, HJ ;
Dumontier, M ;
Ling, S ;
Haider, N ;
Hogue, CWV .
FEBS LETTERS, 2005, 579 (21) :4685-4691
[8]   ClassyFire: automated chemical classification with a comprehensive, computable taxonomy [J].
Feunang, Yannick Djoumbou ;
Eisner, Roman ;
Knox, Craig ;
Chepelev, Leonid ;
Hastings, Janna ;
Owen, Gareth ;
Fahy, Eoin ;
Steinbeck, Christoph ;
Subramanian, Shankar ;
Bolton, Evan ;
Greiner, Russell ;
Wishart, David S. .
JOURNAL OF CHEMINFORMATICS, 2016, 8 :1-20
[9]   ChEMBL: a large-scale bioactivity database for drug discovery [J].
Gaulton, Anna ;
Bellis, Louisa J. ;
Bento, A. Patricia ;
Chambers, Jon ;
Davies, Mark ;
Hersey, Anne ;
Light, Yvonne ;
McGlinchey, Shaun ;
Michalovich, David ;
Al-Lazikani, Bissan ;
Overington, John P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D1100-D1107
[10]   Modern 2D QSAR for drug discovery [J].
Lewis, Richard A. ;
Wood, David .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2014, 4 (06) :505-522