Prediction of potential drug targets based on simple sequence properties

被引:98
作者
Li, Qingliang
Lai, Luhua [1 ]
机构
[1] Peking Univ, Coll Chem & Mol Engn, State Key Lab Struct Chem Stable & Unstable Speci, Beijing Natl Lab Mol Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Ctr Theoret Biol, Beijing 100871, Peoples R China
关键词
D O I
10.1186/1471-2105-8-353
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: During the past decades, research and development in drug discovery have attracted much attention and efforts. However, only 324 drug targets are known for clinical drugs up to now. Identifying potential drug targets is the first step in the process of modern drug discovery for developing novel therapeutic agents. Therefore, the identification and validation of new and effective drug targets are of great value for drug discovery in both academia and pharmaceutical industry. If a protein can be predicted in advance for its potential application as a drug target, the drug discovery process targeting this protein will be greatly speeded up. In the current study, based on the properties of known drug targets, we have developed a sequence-based drug target prediction method for fast identification of novel drug targets. Results: Based on simple physicochemical properties extracted from protein sequences of known drug targets, several support vector machine models have been constructed in this study. The best model can distinguish currently known drug targets from non drug targets at an accuracy of 84%. Using this model, potential protein drug targets of human origin from Swiss-Prot were predicted, some of which have already attracted much attention as potential drug targets in pharmaceutical research. Conclusion: We have developed a drug target prediction method based solely on protein sequence information without the knowledge of family/domain annotation, or the protein 3D structure. This method can be applied in novel drug target identification and validation, as well as genome scale drug target predictions.
引用
收藏
页数:11
相关论文
共 35 条
[1]  
An Jianghong, 2004, Genome Inform, V15, P31
[2]   The NOX family of ROS-generating NADPH oxidases: Physiology and pathophysiology [J].
Bedard, Karen ;
Krause, Karl-Heinz .
PHYSIOLOGICAL REVIEWS, 2007, 87 (01) :245-313
[3]   How many genomics targets can a portfolio afford? [J].
Betz, UAK .
DRUG DISCOVERY TODAY, 2005, 10 (15) :1057-1063
[4]   Target discovery and validation in the post-genomic era [J].
Butcher, SP .
NEUROCHEMICAL RESEARCH, 2003, 28 (02) :367-371
[5]   Comparison of support vector machine and artificial neural network systems for drug/nondrug classification [J].
Byvatov, E ;
Fechner, U ;
Sadowski, J ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (06) :1882-1889
[6]   SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence [J].
Cai, CZ ;
Han, LY ;
Ji, ZL ;
Chen, X ;
Chen, YZ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3692-3697
[7]  
Chan C. C., 2001, LIBSVM LIB SUPPORT V
[8]   TTD: Therapeutic Target Database [J].
Chen, X ;
Ji, ZL ;
Chen, YZ .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :412-415
[9]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[10]   Multi-class protein fold recognition using support vector machines and neural networks [J].
Ding, CHQ ;
Dubchak, I .
BIOINFORMATICS, 2001, 17 (04) :349-358