De novo prediction of human chromosome structures: Epigenetic marking patterns encode genome architecture

被引:156
作者
Di Pierro, Michele [1 ]
Cheng, Ryan R. [1 ]
Aiden, Erez Lieberman [1 ,2 ]
Wolynes, Peter G. [1 ,3 ,4 ]
Onuchic, Jose N. [1 ,4 ]
机构
[1] Rice Univ, Ctr Theoret Biol Phys, Houston, TX 77005 USA
[2] Baylor Coll Med, Ctr Genome Architecture, Houston, TX 77030 USA
[3] Rice Univ, Dept Chem, Houston, TX 77005 USA
[4] Rice Univ, Dept Phys & Astron, Houston, TX 77005 USA
基金
美国国家科学基金会;
关键词
epigenetics; machine learning; energy landscape theory; genomic architecture; Hi-C; TOPOLOGICAL DOMAINS; PHASE-SEPARATION; 3D GENOME; SPATIAL-ORGANIZATION; ENERGY LANDSCAPES; CHROMATIN DOMAINS; MODEL; PRINCIPLES; LOOPS; HETEROCHROMATIN;
D O I
10.1073/pnas.1714980114
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Inside the cell nucleus, genomes fold into organized structures that are characteristic of cell type. Here, we show that this chromatin architecture can be predicted de novo using epigenetic data derived from chromatin immunoprecipitation-sequencing (ChIP-Seq). We exploit the idea that chromosomes encode a 1D sequence of chromatin structural types. Interactions between these chromatin types determine the 3D structural ensemble of chromosomes through a process similar to phase separation. First, a neural network is used to infer the relation between the epigenetic marks present at a locus, as assayed by ChIP-Seq, and the genomic compartment in which those loci reside, as measured by DNA-DNA proximity ligation (Hi-C). Next, types inferred fromthis neural network are used as an input to an energy landscape model for chromatin organization [Minimal Chromatin Model (MiChroM)] to generate an ensemble of 3D chromosome conformations at a resolution of 50 kilobases (kb). After training the model, dubbed Maximum Entropy Genomic Annotation from Biomarkers Associated to Structural Ensembles (MEGABASE), on odd-numbered chromosomes, we predict the sequences of chromatin types and the subsequent 3D conformational ensembles for the even chromosomes. We validate these structural ensembles by using ChIP-Seq tracks alone to predict Hi-C maps, as well as distances measured using 3D fluorescence in situ hybridization (FISH) experiments. Both sets of experiments support the hypothesis of phase separation being the driving process behind compart-mentalization. These findings strongly suggest that epigenetic marking patterns encode sufficient information to determine the global architecture of chromosomes and that de novo structure prediction for whole genomes may be increasingly possible.
引用
收藏
页码:12126 / 12131
页数:6
相关论文
共 37 条
  • [1] [Anonymous], 2002, LAUR024481 LANL
  • [2] Complexity of chromatin folding is captured by the strings and binders switch model
    Barbieri, Mariano
    Chotalia, Mita
    Fraser, James
    Lavitas, Liron-Mark
    Dostie, Josee
    Pombo, Ana
    Nicodemi, Mario
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (40) : 16173 - 16178
  • [3] The Spatial Organization of the Human Genome
    Bickmore, Wendy A.
    [J]. ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 14, 2013, 14 : 67 - 84
  • [4] Simulated binding of transcription factors to active and inactive regions folds human chromosomes into loops, rosettes and topological domains
    Brackley, Chris A.
    Johnson, James
    Kelly, Steven
    Cook, Peter R.
    Marenduzzo, Davide
    [J]. NUCLEIC ACIDS RESEARCH, 2016, 44 (08) : 3503 - 3512
  • [5] Bryngelson JD, 1990, TETRAHEDRON COMPUT M, V3, P129
  • [6] Chromosome territories, nuclear architecture and gene regulation in mammalian cells
    Cremer, T
    Cremer, C
    [J]. NATURE REVIEWS GENETICS, 2001, 2 (04) : 292 - 301
  • [7] Transferable model for chromosome architecture
    Di Pierro, Michele
    Zhang, Bin
    Aiden, Erez Lieberman
    Wolynes, Peter G.
    Onuchic, Jose N.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (43) : 12168 - 12173
  • [8] Topological domains in mammalian genomes identified by analysis of chromatin interactions
    Dixon, Jesse R.
    Selvaraj, Siddarth
    Yue, Feng
    Kim, Audrey
    Li, Yan
    Shen, Yin
    Hu, Ming
    Liu, Jun S.
    Ren, Bing
    [J]. NATURE, 2012, 485 (7398) : 376 - 380
  • [9] Stable Chromosome Condensation Revealed by Chromosome Conformation Capture
    Eagen, Kyle P.
    Hartl, Tom A.
    Kornberg, Roger D.
    [J]. CELL, 2015, 163 (04) : 934 - 946
  • [10] Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models
    Ekeberg, Magnus
    Lovkvist, Cecilia
    Lan, Yueheng
    Weigt, Martin
    Aurell, Erik
    [J]. PHYSICAL REVIEW E, 2013, 87 (01)