Three-Dimensional Biologically Relevant Spectrum (BRS-3D): Shape Similarity Profile Based on PDB Ligands as Molecular Descriptors

被引:16
作者
Hu, Ben [1 ,2 ]
Kuang, Zheng-Kun [1 ,2 ]
Feng, Shi-Yu [1 ]
Wang, Dong [1 ]
He, Song-Bing [1 ]
Kong, De-Xin [1 ,2 ]
机构
[1] Huazhong Agr Univ, State Key Lab Agr Microbiol, Wuhan 430070, Peoples R China
[2] Huazhong Agr Univ, Coll Informat, Agr Bioinformat Key Lab Hubei Prov, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
BRS-3D; molecular similarity profile; QSAR; SVM; ligand-based virtual screening; subtype selectivity; SUPPORT VECTOR MACHINE; PREDICTING SUBTYPE SELECTIVITY; DRUG DISCOVERY; SCOP DATABASE; MEDICINAL CHEMISTRY; RECEPTOR LIGANDS; CHEMICAL SPACE; DOCKING; BINDING; PHARMACOLOGY;
D O I
10.3390/molecules21111554
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The crystallized ligands in the Protein Data Bank (PDB) can be treated as the inverse shapes of the active sites of corresponding proteins. Therefore, the shape similarity between a molecule and PDB ligands indicated the possibility of the molecule to bind with the targets. In this paper, we proposed a shape similarity profile that can be used as a molecular descriptor for ligand-based virtual screening. First, through three-dimensional (3D) structural clustering, 300 diverse ligands were extracted from the druggable protein-ligand database, sc-PDB. Then, each of the molecules under scrutiny was flexibly superimposed onto the 300 ligands. Superimpositions were scored by shape overlap and property similarity, producing a 300 dimensional similarity array termed the "Three-Dimensional Biologically Relevant Spectrum (BRS-3D)". Finally, quantitative or discriminant models were developed with the 300 dimensional descriptor using machine learning methods (support vector machine). The effectiveness of this approach was evaluated using 42 benchmark data sets from the G protein-coupled receptor (GPCR) ligand library and the GPCR decoy database (GLL/GDD). We compared the performance of BRS-3D with other 2D and 3D state-of-the-art molecular descriptors. The results showed that models built with BRS-3D performed best for most GLL/GDD data sets. We also applied BRS-3D in histone deacetylase 1 inhibitors screening and GPCR subtype selectivity prediction. The advantages and disadvantages of this approach are discussed.
引用
收藏
页数:20
相关论文
共 75 条
[1]   Beware of R2: Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models [J].
Alexander, D. L. J. ;
Tropsha, A. ;
Winkler, David A. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (07) :1316-1322
[2]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[3]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[4]   SCOP2 prototype: a new approach to protein structure mining [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chothia, Cyrus ;
Kulesha, Eugene ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D310-D314
[5]  
[Anonymous], 2011, PIP PIL VERS 8 5
[6]  
[Anonymous], 2006, DRAG WIND VERS 5 4
[7]  
[Anonymous], 2004, AV CHEM DIR ACD VERS
[8]  
[Anonymous], 2009, VERS 2008 REL 10
[9]   The Physiology, Signaling, and Pharmacology of Dopamine Receptors [J].
Beaulieu, Jean-Martin ;
Gainetdinov, Raul R. .
PHARMACOLOGICAL REVIEWS, 2011, 63 (01) :182-217
[10]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242