Predicting DNA-binding sites of proteins based on sequential and 3D structural information

被引:18
|
作者
Li, Bi-Qing [1 ,2 ]
Feng, Kai-Yan [3 ]
Ding, Juan [4 ]
Cai, Yu-Dong [5 ]
机构
[1] Chinese Acad Sci, Key Lab Syst Biol, Shanghai Inst Biol Sci, Shanghai 200031, Peoples R China
[2] Shanghai Ctr Bioinformat Technol, Shanghai, Peoples R China
[3] Beijing Genom Inst, Shenzhen 518083, Peoples R China
[4] Harvard Univ, Sch Med, Schepens Eye Res Inst, Boston, MA 02114 USA
[5] Shanghai Univ, Inst Syst Biol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein-DNA interactions; Structural features; Random Forest (RF); Maximum relevance minimum redundancy (mRMR); Incremental feature selection (IFS); INTRINSIC DISORDER; SMO ALGORITHM; EFFICIENT PREDICTION; ACCURATE PREDICTION; RESIDUES; RECOGNITION; RELEVANCE; INDEX; MRMR;
D O I
10.1007/s00438-014-0812-x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein-DNA interactions play important roles in many biological processes. To understand the molecular mechanisms of protein-DNA interaction, it is necessary to identify the DNA-binding sites in DNA-binding proteins. In the last decade, computational approaches have been developed to predict protein-DNA-binding sites based solely on protein sequences. In this study, we developed a novel predictor based on support vector machine algorithm coupled with the maximum relevance minimum redundancy method followed by incremental feature selection. We incorporated not only features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure, solvent accessibility, but also five three-dimensional (3D) structural features calculated from PDB data to predict the protein-DNA interaction sites. Feature analysis showed that 3D structural features indeed contributed to the prediction of DNA-binding site and it was demonstrated that the prediction performance was better with 3D structural features than without them. It was also shown via analysis of features from each site that the features of DNA-binding site itself contribute the most to the prediction. Our prediction method may become a useful tool for identifying the DNA-binding sites and the feature analysis described in this paper may provide useful insights for in-depth investigations into the mechanisms of protein-DNA interaction.
引用
收藏
页码:489 / 499
页数:11
相关论文
共 50 条
  • [1] Predicting DNA-binding sites of proteins based on sequential and 3D structural information
    Bi-Qing Li
    Kai-Yan Feng
    Juan Ding
    Yu-Dong Cai
    Molecular Genetics and Genomics, 2014, 289 : 489 - 499
  • [2] Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins
    Kuznetsov, Igor B.
    Gou, Zhenkun
    Li, Run
    Hwang, Seungwoo
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 64 (01) : 19 - 27
  • [3] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [4] Set of approaches based on 3D structure and position specific-scoring matrix for predicting DNA-binding proteins
    Nanni, Loris
    Brahnam, Sheryl
    BIOINFORMATICS, 2019, 35 (11) : 1844 - 1851
  • [5] Predicting DNA-binding sites of proteins from amino acid sequence
    Changhui Yan
    Michael Terribilini
    Feihong Wu
    Robert L Jernigan
    Drena Dobbs
    Vasant Honavar
    BMC Bioinformatics, 7
  • [6] Predicting DNA-binding sites of proteins from amino acid sequence
    Yan, Changhui
    Terribilini, Michael
    Wu, Feihong
    Jernigan, Robert L.
    Dobbs, Drena
    Honavar, Vasant
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [7] Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
    Ahmad, S
    Gromiha, MM
    Sarai, A
    BIOINFORMATICS, 2004, 20 (04) : 477 - 486
  • [8] Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins
    Jones, S
    Shanahan, HP
    Berman, HM
    Thornton, JM
    NUCLEIC ACIDS RESEARCH, 2003, 31 (24) : 7189 - 7198
  • [9] SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS
    BERG, OG
    VONHIPPEL, PH
    TRENDS IN BIOCHEMICAL SCIENCES, 1988, 13 (06) : 207 - 211
  • [10] Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
    Chen, Chien-Yu
    Chien, Ting-Ying
    Lin, Chih-Kang
    Lin, Chih-Wei
    Weng, Yi-Zhong
    Chang, Darby Tien-Hao
    PLOS ONE, 2012, 7 (02):