ProteDNA: a sequence-based predictor of sequence-specific DNA-binding residues in transcription factors

被引:37
作者
Chu, Wen-Yi [2 ]
Huang, Yu-Feng [2 ]
Huang, Chun-Chin [1 ]
Cheng, Yi-Sheng [3 ]
Huang, Chien-Kang [1 ]
Oyang, Yen-Jen [2 ,4 ,5 ]
机构
[1] Natl Taiwan Univ, Dept Engn Sci & Ocean Engn, Taipei 10764, Taiwan
[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10764, Taiwan
[3] Natl Taiwan Univ, Dept Life Sci, Taipei 10764, Taiwan
[4] Natl Taiwan Univ, Grad Inst Biomed Elect & Bioinformat, Taipei 10764, Taiwan
[5] Natl Taiwan Univ, Ctr Syst Biol & Bioinformat, Taipei 10764, Taiwan
关键词
SITES; PROTEINS;
D O I
10.1093/nar/gkp449
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This article presents the design of a sequence-based predictor named ProteDNA for identifying the sequence-specific binding residues in a transcription factor (TF). Concerning protein-DNA interactions, there are two types of binding mechanisms involved, namely sequence-specific binding and nonspecific binding. Sequence-specific bindings occur between protein sidechains and nucleotide bases and correspond to sequence-specific recognition of genes. Therefore, sequence-specific bindings are essential for correct gene regulation. In this respect, ProteDNA is distinctive since it has been designed to identify sequence-specific binding residues. In order to accommodate users with different application needs, ProteDNA has been designed to operate under two modes, namely, the high-precision mode and the balanced mode. According to the experiments reported in this article, under the high-precision mode, ProteDNA has been able to deliver precision of 82.3%, specificity of 99.3%, sensitivity of 49.8% and accuracy of 96.5%. Meanwhile, under the balanced mode, ProteDNA has been able to deliver precision of 60.8%, specificity of 97.6%, sensitivity of 60.7% and accuracy of 95.4%. ProteDNA is available at the following websites: http://protedna.csbb.ntu.edu.tw/ http:// protedna.csie.ntu.edu.tw/ http://bio222.esoe.ntu.edu.tw/ProteDNA/.
引用
收藏
页码:W396 / W401
页数:6
相关论文
共 19 条
[1]   PSSM-based prediction of DNA binding sites in proteins [J].
Ahmad, S ;
Sarai, A .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]  
Boyer R.F., 2005, CONCEPTS BIOCH, V3rd
[5]   HTHquery: a method for detecting DNA-binding proteins with a helix-turn-helix structural motif [J].
Ferrer-Costa, C ;
Shanahan, HP ;
Jones, S ;
Thornton, JM .
BIOINFORMATICS, 2005, 21 (18) :3679-3680
[6]   Pfam:: clans, web tools and services [J].
Finn, Robert D. ;
Mistry, Jaina ;
Schuster-Bockler, Benjamin ;
Griffiths-Jones, Sam ;
Hollich, Volker ;
Lassmann, Timo ;
Moxon, Simon ;
Marshall, Mhairi ;
Khanna, Ajay ;
Durbin, Richard ;
Eddy, Sean R. ;
Sonnhammer, Erik L. L. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D247-D251
[7]   SSEP-Domain: protein domain prediction by alignment of secondary structure elements and profiles [J].
Gewehr, JE ;
Zimmer, R .
BIOINFORMATICS, 2006, 22 (02) :181-187
[8]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[9]   DP-Bind: a Web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins [J].
Hwang, Seungwoo ;
Gou, Zhenkun ;
Kuznetsov, Igor B. .
BIOINFORMATICS, 2007, 23 (05) :634-636
[10]   Protein secondary structure prediction based on position-specific scoring matrices [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 292 (02) :195-202