Discovering hotspots in functional genomic data superposed on 3D chromatin configuration reconstructions

被引:18
作者
Capurso, Daniel [1 ]
Bengtsson, Henrik [2 ]
Segal, Mark R. [2 ]
机构
[1] Univ Calif San Francisco, Dept Bioengn & Therapeut Sci, San Francisco, CA 94158 USA
[2] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94158 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
HI-C DATA; SACCHAROMYCES-CEREVISIAE; CHIP-SEQ; NUCLEAR-ORGANIZATION; BINDING-SITES; BUDDING YEAST; SINGLE-CELL; REVEALS; GENE; TRANSCRIPTION;
D O I
10.1093/nar/gkw070
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The spatial organization of the genome influences cellular function, notably gene regulation. Recent studies have assessed the three-dimensional (3D) co-localization of functional annotations (e.g. centromeres, long terminal repeats) using 3D genome reconstructions from Hi-C (genome-wide chromosome conformation capture) data; however, corresponding assessments for continuous functional genomic data (e.g. chromatin immunoprecipitation-sequencing (ChIP-seq) peak height) are lacking. Here, we demonstrate that applying bump hunting via the patient rule induction method (PRIM) to ChIP-seq data superposed on a Saccharomyces cerevisiae 3D genome reconstruction can discover 'functional 3D hotspots', regions in 3-space for which the mean ChIP-seq peak height is significantly elevated. For the transcription factor Swi6, the top hotspot by P-value contains MSB2 and ERG11 - known Swi6 target genes on different chromosomes. We verify this finding in a number of ways. First, this top hotspot is relatively stable under PRIM across parameter settings. Second, this hotspot is among the top hotspots by mean outcome identified by an alternative algorithm, k-Nearest Neighbor (k-NN) regression. Third, the distance between MSB2 and ERG11 is smaller than expected (by resampling) in two other 3D reconstructions generated via different normalization and reconstruction algorithms. This analytic approach can discover functional 3D hotspots and potentially reveal novel regulatory interactions.
引用
收藏
页码:2028 / 2035
页数:8
相关论文
共 56 条
[1]  
Adler R. J., 2010, I MATH STAT COLLECTI, V6, DOI [10.1214/10-IMSCOLL609, DOI 10.1214/10-IMSCOLL609]
[2]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[3]  
[Anonymous], 2013, FNN: Fast Nearest Neighbor Search Algorithms and Applications
[4]  
[Anonymous], 5 STANDF U LAB COMP
[5]   Genome3D: A viewer-model framework for integrating and visualizing multi-scale epigenomic information within a three-dimensional genome [J].
Asbury, Thomas M. ;
Mitman, Matt ;
Tang, Jijun ;
Zheng, W. Jim .
BMC BIOINFORMATICS, 2010, 11
[6]   Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression [J].
Ay, Ferhat ;
Bunnik, Evelien M. ;
Varoquaux, Nelle ;
Bol, Sebastiaan M. ;
Prudhomme, Jacques ;
Vert, Jean-Philippe ;
Noble, William Stafford ;
Le Roch, Karine G. .
GENOME RESEARCH, 2014, 24 (06) :974-988
[7]   Spatial localization of co-regulated genes exceeds genomic gene clustering in the Saccharomyces cerevisiae genome [J].
Ben-Elazar, Shay ;
Yakhini, Zohar ;
Yanai, Itai .
NUCLEIC ACIDS RESEARCH, 2013, 41 (04) :2191-2201
[8]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[9]   Distance-based assessment of the localization of functional annotations in 3D genome reconstructions [J].
Capurso, Daniel ;
Segal, Mark R. .
BMC GENOMICS, 2014, 15
[10]   Genome-wide localization of the nuclear transport machinery couples transcriptional status and nuclear organization [J].
Casolari, JM ;
Brown, CR ;
Komili, S ;
West, J ;
Hieronymus, H ;
Silver, PA .
CELL, 2004, 117 (04) :427-439