Predicting DNA-binding sites of proteins based on sequential and 3D structural information

被引:18
|
作者
Li, Bi-Qing [1 ,2 ]
Feng, Kai-Yan [3 ]
Ding, Juan [4 ]
Cai, Yu-Dong [5 ]
机构
[1] Chinese Acad Sci, Key Lab Syst Biol, Shanghai Inst Biol Sci, Shanghai 200031, Peoples R China
[2] Shanghai Ctr Bioinformat Technol, Shanghai, Peoples R China
[3] Beijing Genom Inst, Shenzhen 518083, Peoples R China
[4] Harvard Univ, Sch Med, Schepens Eye Res Inst, Boston, MA 02114 USA
[5] Shanghai Univ, Inst Syst Biol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein-DNA interactions; Structural features; Random Forest (RF); Maximum relevance minimum redundancy (mRMR); Incremental feature selection (IFS); INTRINSIC DISORDER; SMO ALGORITHM; EFFICIENT PREDICTION; ACCURATE PREDICTION; RESIDUES; RECOGNITION; RELEVANCE; INDEX; MRMR;
D O I
10.1007/s00438-014-0812-x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein-DNA interactions play important roles in many biological processes. To understand the molecular mechanisms of protein-DNA interaction, it is necessary to identify the DNA-binding sites in DNA-binding proteins. In the last decade, computational approaches have been developed to predict protein-DNA-binding sites based solely on protein sequences. In this study, we developed a novel predictor based on support vector machine algorithm coupled with the maximum relevance minimum redundancy method followed by incremental feature selection. We incorporated not only features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure, solvent accessibility, but also five three-dimensional (3D) structural features calculated from PDB data to predict the protein-DNA interaction sites. Feature analysis showed that 3D structural features indeed contributed to the prediction of DNA-binding site and it was demonstrated that the prediction performance was better with 3D structural features than without them. It was also shown via analysis of features from each site that the features of DNA-binding site itself contribute the most to the prediction. Our prediction method may become a useful tool for identifying the DNA-binding sites and the feature analysis described in this paper may provide useful insights for in-depth investigations into the mechanisms of protein-DNA interaction.
引用
收藏
页码:489 / 499
页数:11
相关论文
共 50 条
  • [31] Structure based prediction of binding residues on DNA-binding proteins
    Bhardwaj, Nitin
    Langlois, Robert E.
    Hui, Guijun Zhao
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2611 - 2614
  • [32] Inadequacy of Evolutionary Profiles Vis-a-vis Single Sequences in Predicting Transient DNA-Binding Sites in Proteins
    Arya, Ajay
    Varghese, Dana Mary
    Verma, Ajay Kumar
    Ahmad, Shandar
    JOURNAL OF MOLECULAR BIOLOGY, 2022, 434 (13)
  • [33] DISCRIMINATION OF DNA-BINDING SITES BY MUTANT P53 PROTEINS
    THUKRAL, SK
    LU, Y
    BLAIN, GC
    HARVEY, TS
    JACOBSEN, VL
    MOLECULAR AND CELLULAR BIOLOGY, 1995, 15 (09) : 5196 - 5202
  • [34] How DNA-Binding Proteins find their Target Sites in Human Cells
    Normanno, Davide
    Boudarene, Lydia
    Dugast-Darzacq, Claire
    Darzacq, Xavier
    Dahan, Maxime
    BIOPHYSICAL JOURNAL, 2013, 104 (02) : 541A - 542A
  • [35] Sequence-Based Prediction of DNA-Binding Residues in Proteins with Conservation and Correlation Information
    Ma, Xin
    Guo, Jing
    Liu, Hong-De
    Xie, Jian-Ming
    Sun, Xiao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1766 - 1775
  • [36] SEARCHING FOR AND PREDICTING THE ACTIVITY OF SITES FOR DNA-BINDING PROTEINS - COMPILATION AND ANALYSIS OF THE BINDING-SITES FOR ESCHERICHIA-COLI INTEGRATION HOST FACTOR (IHF)
    GOODRICH, JA
    SCHWARTZ, ML
    MCCLURE, WR
    NUCLEIC ACIDS RESEARCH, 1990, 18 (17) : 4993 - 5000
  • [37] 3D similarities between the binding sites of monoaminergic target proteins
    Nunez-Vivanco, Gabriel
    Fierro, Angelica
    Moya, Pablo
    Iturriaga-Vasquez, Patricio
    Reyes-Parada, Miguel
    PLOS ONE, 2018, 13 (07):
  • [38] An ensemble of reduced alphabets with protein encoding based on grouped weight for predicting DNA-binding proteins
    Loris Nanni
    Alessandra Lumini
    Amino Acids, 2009, 36 : 167 - 175
  • [39] An ensemble of reduced alphabets with protein encoding based on grouped weight for predicting DNA-binding proteins
    Nanni, Loris
    Lumini, Alessandra
    AMINO ACIDS, 2009, 36 (02) : 167 - 175
  • [40] Structural Mechanisms of Cooperative DNA Binding by Bacterial Single-Stranded DNA-Binding Proteins
    Dubiel, Katarzyna
    Myers, Angela R.
    Kozlov, Alexander G.
    Yang, Olivia
    Zhang, Jichuan
    Ha, Taekjip
    Lohman, Timothy M.
    Keck, James L.
    JOURNAL OF MOLECULAR BIOLOGY, 2019, 431 (02) : 178 - 195