Modeling the cis-regulatory modules of genes expressed in developmental stages of Drosophila melanogaster

被引:1
作者
Lopez, Yosvany [1 ,2 ]
Vandenbon, Alexis [3 ]
Nose, Akinao [4 ]
Nakai, Kenta [1 ]
机构
[1] Univ Tokyo, Human Genome Ctr, Inst Med Sci, Tokyo, Japan
[2] Univ Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Chiba, Japan
[3] Osaka Univ, Immunol Frontier Res Ctr, Osaka, Japan
[4] Univ Tokyo, Grad Sch Frontier Sci, Dept Complex Sci & Engn, Chiba, Japan
来源
PEERJ | 2017年 / 5卷
关键词
Promoter architecture; Co-expression; Genetic algorithm; Transcription factor binding sites; Developmental stage; Genome-wide analysis; FACTOR-BINDING SITES; GENOME-WIDE ANALYSIS; CHIP-SEQ; PROTEIN; DORSAL; IDENTIFICATION; ELEMENTS; SPECIFICITY; DATABASE; REGIONS;
D O I
10.7717/peerj.3389
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Because transcription is the first step in the regulation of gene expression, understanding how transcription factors bind to their DNA binding motifs has become absolutely necessary. It has been shown that the promoters of genes with similar expression profiles share common structural patterns. This paper presents an extensive study of the regulatory regions of genes expressed in 24 developmental stages of Drosophila melanogaster. It proposes the use of a combination of structural features, such as positioning of individual motifs relative to the transcription start site, orientation, pairwise distance between motifs, and presence of motifs anywhere in the promoter for predicting gene expression from structural features of promoter sequences. RNA-sequencing data was utilized to create and validate the 24 models. When genes with high-scoring promoters were compared to those identified by RNA-seq samples, 19 (79.2%) statistically significant models, a number that exceeds previous studies, were obtained. Each model yielded a set of highly informative features, which were used to search for genes with similar biological functions.
引用
收藏
页数:24
相关论文
共 57 条
  • [1] [Anonymous], 11 IEEE INT C DAT MI
  • [2] [Anonymous], 2009, FASTX TOOLKIT
  • [3] The Drosophila zinc finger transcription factor CF2 is a myogenic marker downstream of MEF2 during muscle development
    Bagni, C
    Bray, S
    Gogos, JA
    Kafatos, FC
    Hsu, T
    [J]. MECHANISMS OF DEVELOPMENT, 2002, 117 (1-2) : 265 - 268
  • [4] MEME: discovering and analyzing DNA and protein sequence motifs
    Bailey, Timothy L.
    Williams, Nadya
    Misleh, Chris
    Li, Wilfred W.
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : W369 - W373
  • [5] Bajic VB, 2003, SILICO BIOL, V4, P1
  • [6] The legacy of Drosophila imaginal discs
    Beira, Jorge V.
    Paro, Renato
    [J]. CHROMOSOMA, 2016, 125 (04) : 573 - 592
  • [7] Campos-Ortega J. A., 2013, The embryonic development of Drosophila melanogaster, V2nd
  • [8] Identification of novel genes in Drosophila reveals the complex regulation of early gene activity in the mesoderm
    Casal, J
    Leptin, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (19) : 10327 - 10332
  • [9] Eukaryotic transcriptional dynamics: from single molecules to cell populations
    Coulon, Antoine
    Chow, Carson C.
    Singer, Robert H.
    Larson, Daniel R.
    [J]. NATURE REVIEWS GENETICS, 2013, 14 (08) : 572 - 584
  • [10] tailup, A LIM-HD gene, and Iro-C cooperate in Drosophila dorsal mesothorax specification
    de Navascues, Joaquin
    Modolell, Juan
    [J]. DEVELOPMENT, 2007, 134 (09): : 1779 - 1788