Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli

被引:33
作者
Meysman, Pieter [1 ]
Thanh Hai Dang [2 ]
Laukens, Kris [2 ]
De Smet, Riet [1 ]
Wu, Yan [1 ]
Marchal, Kathleen [1 ]
Engelen, Kristof [1 ]
机构
[1] Katholieke Univ Leuven, Dept Microbial & Mol Syst, B-3001 Leuven, Belgium
[2] Dept Math & Comp Sci, Intelligent Syst Lab, B-2020 Antwerp, Belgium
关键词
MOLECULAR-DYNAMICS SIMULATIONS; CORE PROMOTER; B-DNA; SEQUENCE; PROTEIN; GENE; IDENTIFICATION; RECOGNITION; PARAMETERS; STABILITY;
D O I
10.1093/nar/gkq1071
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recognition of genomic binding sites by transcription factors can occur through base-specific recognition, or by recognition of variations within the structure of the DNA macromolecule. In this article, we investigate what information can be retrieved from local DNA structural properties that is relevant to transcription factor binding and that cannot be captured by the nucleotide sequence alone. More specifically, we explore the benefit of employing the structural characteristics of DNA to create binding-site models that encompass indirect recognition for the Escherichia coli model organism. We developed a novel methodology [Conditional Random fields of Smoothed Structural Data (CRoSSeD)], based on structural scales and conditional random fields to model and predict regulator binding sites. The value of relying on local structural-DNA properties is demonstrated by improved classifier performance on a large number of biological datasets, and by the detection of novel binding sites which could be validated by independent data sources, and which could not be identified using sequence data alone. We further show that the CRoSSeD-binding-site models can be related to the actual molecular mechanisms of the transcription factor DNA binding, and thus cannot only be used for prediction of novel sites, but might also give valuable insights into unknown binding mechanisms of transcription factors.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] The strength of transcription-factor binding modulates co-variation in transcriptional networks
    Nuzhdin, Sergey V.
    Rychkova, Anna
    Hahn, Matthew W.
    TRENDS IN GENETICS, 2010, 26 (02) : 51 - 53
  • [32] Prediction of Transcription Factor Binding Sites With an Attention Augmented Convolutional Neural Network
    Jing, Fang
    Zhang, Shao-Wu
    Zhang, Shihua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3614 - 3623
  • [33] A Review About Transcription Factor Binding Sites Prediction Based on Deep Learning
    Zeng, Yuanqi
    Gong, Meiqin
    Lin, Meng
    Gao, Dongrui
    Zhang, Yongqing
    IEEE ACCESS, 2020, 8 : 219256 - 219274
  • [34] An all-atom knowledge-based energy function for protein-DNA threading, docking decoy discrimination, and prediction of transcription-factor binding profiles
    Xu, Beisi
    Yang, Yuedong
    Liang, Haojun
    Zhou, Yaoqi
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 76 (03) : 718 - 730
  • [35] TFBSshape: a motif database for DNA shape features of transcription factor binding sites
    Yang, Lin
    Zhou, Tianyin
    Dror, Iris
    Mathelier, Anthony
    Wasserman, Wyeth W.
    Gordan, Raluca
    Rohs, Remo
    NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D148 - D155
  • [36] COMPUTER-AIDED ANALYSIS OF POTENTIAL TRANSCRIPTION-FACTOR BINDING-SITES IN THE RABBIT BETA-CASEIN GENE PROMOTER
    MALEWSKI, T
    ZWIERZCHOWSKI, L
    BIOSYSTEMS, 1995, 36 (02) : 109 - 119
  • [37] A comparison study on feature selection of DNA structural properties for promoter prediction
    Gan, Yanglan
    Guan, Jihong
    Zhou, Shuigeng
    BMC BIOINFORMATICS, 2012, 13
  • [38] Inhibition of transcription factor assembly and structural stability on mitoxantrone binding with DNA
    Khan, Shahper N.
    Danishuddin, Mohd
    Khan, Asad U.
    BIOSCIENCE REPORTS, 2010, 30 (05) : 331 - 340
  • [39] Computational prediction of transcription factor binding sites based on an integrative approach incorporating genomic and epigenomic features
    Seok, Ho-Sik
    Kim, Jaebum
    GENES & GENOMICS, 2014, 36 (01) : 25 - 30
  • [40] Predicting bacterial transcription factor binding sites through machine learning and structural characterization based on DNA duplex stability
    Borges Farias, Andre
    Martinez, Gustavo Sganzerla
    Galan-Vasquez, Edgardo
    Nicolas, Marisa Fabiana
    Perez-Rueda, Ernesto
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)