Investigation of similarity and diversity threshold networks generated from diversity-oriented and focused chemical libraries

被引:0
作者
Ganesh Prabhu
Sudeepto Bhattacharya
Michael P. Krein
N. Sukumar
机构
[1] Shiv Nadar University,Department of Chemistry
[2] Shiv Nadar University,Department of Mathematics and Center for Informatics
[3] Lockheed-Martin Advanced Technology Laboratories,Department of Chemistry and Center for Informatics
[4] Shiv Nadar University,undefined
来源
Journal of Mathematical Chemistry | 2016年 / 54卷
关键词
Dissimilarity; Similarity; Diversity; Small-world; Chemical space networks;
D O I
暂无
中图分类号
学科分类号
摘要
Topological properties of chemical library networks, such as the average clustering coefficient, average path length, and existence of hubs, can serve as indicators to describe the inherent complexities of chemical libraries. We have used Diversity-Oriented Synthesis (DOS) and Focussed Libraries to investigate the appearance of scale-free properties and absence of small-world behavior in chemical libraries. DOS aims to elicit structural complexity in small compounds with respect to skeleton, functional groups, appendages and stereochemistry. Complexity here indicates incorporation of sp3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {sp}^{3}$$\end{document} carbons, hydrogen bond acceptors and donors in the molecule. Biological studies have shown how structural complexity enhances the interaction of molecules with complex biological macromolecules. In contrast, Focussed Libraries concentrate on specific scaffolds against a specific biological target. We have quantified the diversity in several DOS and Focussed Libraries based on properties of similarity and dissimilarity threshold networks formed from them. Similarity and dissimilarity networks were generated from diverse chemical libraries at various Tanimoto similarity coefficients (tc)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {t}_{\mathrm{c}})$$\end{document} using FP2 and MACCS fingerprints. The dissimilarity networks at very low tc\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {t}_{\mathrm{c}}$$\end{document} threshold led to the absence of small-world behaviors, as evidenced by low average clustering coefficient and high average path length in comparison to Erdös–Renyi networks. Dissimilarity networks exhibit scale free topology as evidenced by a power law degree distribution. The similarity networks at high tc\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {t}_{\mathrm{c}}$$\end{document} threshold have shown high clustering coefficients and low average path lengths, without the appearance of hubs. Combining dissimilarity and similarity threshold graphs revealed assortative and dissortative behaviors in the DOS libraries, leading to the conclusion that the vertices of the dissimilarity communities are more likely to share similarity edges, but it is quite unlikely for the vertices in a similarity community to share dissimilarity edges. We propose a simple and convenient diversity quantification tool, QuaLDI (Quantitative Library Diversity Index) to quantify the diversity in DOS and Focussed libraries. We anticipate that these topological properties can be used as descriptors to quantify the diversity in chemical libraries before proceeding for synthesis.
引用
收藏
页码:1916 / 1941
页数:25
相关论文
共 107 条
[1]  
Raevsky O(2004)undefined Mini-Rev. Med. Chem. 4 1041-5007
[2]  
Newman MEJ(2004)undefined Phys. Rev. E 69 066133-undefined
[3]  
Hert J(2004)undefined J. Chem. Inf. Comput. Sci. 44 1177-undefined
[4]  
Willett P(2011)undefined J. Cheminf. 3 33-undefined
[5]  
Wilton DJ(2010)undefined J. Chem. Inf. Model. 50 742-undefined
[6]  
O’Boyle NM(2011)undefined J. Phys. Chem. A 115 12905-undefined
[7]  
Banck M(2008)undefined J. Chem. Inf. Model. 48 1138-undefined
[8]  
James CA(1998)undefined Nature 393 440-undefined
[9]  
Morley C(1999)undefined Science 286 509-undefined
[10]  
Vandermeersch T(2002)undefined IEEE Infocom. 2 608-undefined