BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences

被引:334
作者
Wang, Liangjiang [1 ]
Brown, Susan J. [1 ]
机构
[1] Kansas State Univ, Bioinformat Ctr, Div Biol, Manhattan, KS 66506 USA
关键词
D O I
10.1093/nar/gkl298
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
BindN (http://bioinformatics.ksu.edu/bindn/) takes an amino acid sequence as input and predicts potential DNA or RNA-binding residues with support vector machines (SVMs). Protein datasets with known DNA or RNA-binding residues were selected from the Protein Data Bank (PDB), and SVM models were constructed using data instances encoded with three sequence features, including the side chain pK(a) value, hydrophobicity index and molecular mass of an amino acid. The results suggest that DNA-binding residues can be predicted at 69.40% sensitivity and 70.47% specificity, while prediction of RNA-binding residues achieves 66.28% sensitivity and 69.84% specificity. When compared with previous studies, the SVM models appear to be more accurate and more efficient for online predictions. BindN provides a useful tool for understanding the function of DNA and RNA-binding proteins based on primary sequence data.
引用
收藏
页码:W243 / W248
页数:6
相关论文
共 19 条
  • [1] PSSM-based prediction of DNA binding sites in proteins
    Ahmad, S
    Sarai, A
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [2] Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
    Ahmad, S
    Gromiha, MM
    Sarai, A
    [J]. BIOINFORMATICS, 2004, 20 (04) : 477 - 486
  • [3] ALTSCHUL SF, 1997, NUCLEIC ACIDS RES, V25, P3402
  • [4] [Anonymous], KERNEL METHODS COMPU
  • [5] The use of the area under the roc curve in the evaluation of machine learning algorithms
    Bradley, AP
    [J]. PATTERN RECOGNITION, 1997, 30 (07) : 1145 - 1159
  • [6] Themes in RNA-protein recognition
    Draper, DE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1999, 293 (02) : 255 - 270
  • [7] RS domains contact the pre-mRNA throughout spliceosome assembly
    Hertel, KJ
    Graveley, BR
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2005, 30 (03) : 115 - 118
  • [8] Joachims J., 1999, ADV KERNEL METHODS S
  • [9] Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins
    Jones, S
    Shanahan, HP
    Berman, HM
    Thornton, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (24) : 7189 - 7198
  • [10] Protein-RNA interactions: a structural analysis
    Jones, S
    Daley, DTA
    Luscombe, NM
    Berman, HM
    Thornton, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (04) : 943 - 954