Identification of gene specific cis-regulatory elements during differentiation of mouse embryonic stem cells: An integrative approach using high-throughput datasets

被引:14
作者
Vijayabaskar, M. S. [1 ,8 ]
Goode, Debbie K. [2 ,3 ,4 ]
Obier, Nadine [5 ]
Lichtinger, Monika [5 ]
Emmett, Amber M. L. [1 ]
Abidin, Fatin N. Zainul [1 ,9 ]
Shar, Nisar [1 ,10 ]
Hannah, Rebecca [2 ,3 ,4 ]
Assi, Salam A. [5 ]
Lie-A-Ling, Michael [6 ]
Gottgens, Berthold [2 ,3 ,4 ]
Lacaud, Georges [6 ]
Kouskoff, Valerie [7 ]
Bonifer, Constanze [5 ]
Westhead, David R. [1 ]
机构
[1] Univ Leeds, Sch Mol & Cellular Biol, Fac Biol Sci, Leeds, W Yorkshire, England
[2] Univ Cambridge, Wellcome Trust, Cambridge, England
[3] Univ Cambridge, MRC Cambridge Stem Cell Inst, Cambridge, England
[4] Univ Cambridge, Cambridge Inst Med Res, Cambridge, England
[5] Univ Birmingham, Inst Canc & Genom Sci, Coll Med & Dent Sci, Birmingham, W Midlands, England
[6] Univ Manchester, CRUK Manchester Inst, Manchester, Lancs, England
[7] Univ Manchester, Div Dev Biol & Med, Manchester, Lancs, England
[8] Wellcome Sanger Inst, Hinxton, England
[9] Univ Kebangsaan Malaysia, Inst Syst Biol INBIOSIS, Bangi, Selangor DE, Malaysia
[10] NED Univ Engn & Technol, Dept Biomed Engn, Karachi, Pakistan
基金
英国惠康基金; 英国医学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
POST-SELECTION INFERENCE; GENOME-WIDE ANALYSIS; TRANSCRIPTION FACTORS; HISTONE MODIFICATIONS; DNA ELEMENTS; ENHANCERS; EXPRESSION; NETWORKS; BINDING; GENERATION;
D O I
10.1371/journal.pcbi.1007337
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene expression governs cell fate, and is regulated via a complex interplay of transcription factors and molecules that change chromatin structure. Advances in sequencing-based assays have enabled investigation of these processes genome-wide, leading to large datasets that combine information on the dynamics of gene expression, transcription factor binding and chromatin structure as cells differentiate. While numerous studies focus on the effects of these features on broader gene regulation, less work has been done on the mechanisms of gene-specific transcriptional control. In this study, we have focussed on the latter by integrating gene expression data for the in vitro differentiation of murine ES cells to macrophages and cardiomyocytes, with dynamic data on chromatin structure, epigenetics and transcription factor binding. Combining a novel strategy to identify communities of related control elements with a penalized regression approach, we developed individual models to identify the potential control elements predictive of the expression of each gene. Our models were compared to an existing method and evaluated using the existing literature and new experimental data from embryonic stem cell differentiation reporter assays. Our method is able to identify transcriptional control elements in a gene specific manner that reflect known regulatory relationships and to generate useful hypotheses for further testing. Author summary The inherited information in our DNA genomes is a code which defines both the functional units (proteins, nucleic acids etc.), and patterns of their usage, necessary to make life. The genome in mammals, such as man and mouse, has genes which code for about 20000 different proteins, but the usage of these proteins differs in each different type of cell within these complex multicellular organisms. How this differential usage is controlled in known as genetic regulation, and that is what we study here. We know that the details lie in how genes are turned on and off, but until the advent of high-throughput sequencing technology a genome-wide study was nearly impossible. Further complicating our efforts to understand genetic regulation is the involvement of parts of the genome that were previously deemed junk. In this work, we have focussed on how the genes are controlled at various developmental stages in mouse, by looking at the sequencing data from different regulatory mechanisms such as protein binding and local changes to DNA packaging etc. On a gene-by-gene basis, we have built statistical models that predict how genes are controlled when cells develop. These predictions provide a focus for future experimental studies of genetic regulation.
引用
收藏
页数:29
相关论文
共 88 条
  • [61] BEDTools: a flexible suite of utilities for comparing genomic features
    Quinlan, Aaron R.
    Hall, Ira M.
    [J]. BIOINFORMATICS, 2010, 26 (06) : 841 - 842
  • [62] A unique chromatin signature uncovers early developmental enhancers in humans
    Rada-Iglesias, Alvaro
    Bajpai, Ruchi
    Swigut, Tomek
    Brugmann, Samantha A.
    Flynn, Ryan A.
    Wysocka, Joanna
    [J]. NATURE, 2011, 470 (7333) : 279 - +
  • [63] Ron G, 2017, NAT COMMUN, V8, DOI [10.1038/s41467-017-023863, 10.1038/s41467-017-02386-3]
  • [64] A predictive modeling approach for cell line-specific long-range regulatory interactions
    Roy, Sushmita
    Siahpirani, Alireza Fotuhi
    Chasman, Deborah
    Knaack, Sara
    Ay, Ferhat
    Stewart, Ron
    Wilson, Michael
    Sridharan, Rupa
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (18) : 8694 - 8712
  • [65] SHRiMP: Accurate Mapping of Short Color-space Reads
    Rumble, Stephen M.
    Lacroute, Phil
    Dalca, Adrian V.
    Fiume, Marc
    Sidow, Arend
    Brudno, Michael
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (05)
  • [66] An experimentally validated network of nine haematopoietic transcription factors reveals mechanisms of cell state stability
    Schuette, Judith
    Wang, Huange
    Antoniou, Stella
    Jarratt, Andrew
    Wilson, Nicola K.
    Riepsaame, Joey
    Calero-Nieto, Fernando J.
    Moignard, Victoria
    Basilico, Silvia
    Kinston, Sarah J.
    Hannah, Rebecca L.
    Chan, Mun Chiang
    Nuernberg, Sylvia T.
    Ouwehand, Willem H.
    Bonzanni, Nicola
    de Bruijn, Marella F. T. R.
    Goettgens, Berthold
    [J]. ELIFE, 2016, 5
  • [67] Genome-wide analysis of the relationships between DNaseI HS, histone modifications and gene expression reveals distinct modes of chromatin domains
    Shu, Wenjie
    Chen, Hebing
    Bo, Xiaochen
    Wang, Shengqi
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (17) : 7428 - 7443
  • [68] Transcription factors: from enhancer binding to developmental control
    Spitz, Francois
    Furlong, Eileen E. M.
    [J]. NATURE REVIEWS GENETICS, 2012, 13 (09) : 613 - 626
  • [69] The accessible chromatin landscape of the human genome
    Thurman, Robert E.
    Rynes, Eric
    Humbert, Richard
    Vierstra, Jeff
    Maurano, Matthew T.
    Haugen, Eric
    Sheffield, Nathan C.
    Stergachis, Andrew B.
    Wang, Hao
    Vernot, Benjamin
    Garg, Kavita
    John, Sam
    Sandstrom, Richard
    Bates, Daniel
    Boatman, Lisa
    Canfield, Theresa K.
    Diegel, Morgan
    Dunn, Douglas
    Ebersol, Abigail K.
    Frum, Tristan
    Giste, Erika
    Johnson, Audra K.
    Johnson, Ericka M.
    Kutyavin, Tanya
    Lajoie, Bryan
    Lee, Bum-Kyu
    Lee, Kristen
    London, Darin
    Lotakis, Dimitra
    Neph, Shane
    Neri, Fidencio
    Nguyen, Eric D.
    Qu, Hongzhu
    Reynolds, Alex P.
    Roach, Vaughn
    Safi, Alexias
    Sanchez, Minerva E.
    Sanyal, Amartya
    Shafer, Anthony
    Simon, Jeremy M.
    Song, Lingyun
    Vong, Shinny
    Weaver, Molly
    Yan, Yongqi
    Zhang, Zhancheng
    Zhang, Zhuzhu
    Lenhard, Boris
    Tewari, Muneesh
    Dorschner, Michael O.
    Hansen, R. Scott
    [J]. NATURE, 2012, 489 (7414) : 75 - 82
  • [70] Regression shrinkage and selection via the lasso: a retrospective
    Tibshirani, Robert
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2011, 73 : 273 - 282