An integrated model for detecting significant chromatin interactions from high-resolution Hi-C data

被引:50
作者
Carty, Mark [1 ,2 ,3 ]
Zamparo, Lee [1 ]
Sahin, Merve [1 ,3 ]
Gonzalez, Alvaro [1 ]
Pelossof, Raphael [1 ]
Elemento, Olivier [2 ]
Leslie, Christina S. [1 ]
机构
[1] Mem Sloan Kettering Canc Ctr, Computat Biol Program, New York, NY 10065 USA
[2] Weill Cornell Med Coll, Inst Computat Biomed, New York, NY 10065 USA
[3] Tri Inst Training Program Computat Biol & Med, New York, NY 10065 USA
关键词
SINGLE-CELL; REVEALS; PRINCIPLES; GENOME;
D O I
10.1038/ncomms15454
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Here we present HiC-DC, a principled method to estimate the statistical significance (P values) of chromatin interactions from Hi-C experiments. HiC-DC uses hurdle negative binomial regression account for systematic sources of variation in Hi-C read counts-for example, distance-dependent random polymer ligation and GC content and mappability bias-and model zero inflation and overdispersion. Applied to high-resolution Hi-C data in a lymphoblastoid cell line, HiC-DC detects significant interactions at the sub-topologically associating domain level, identifying potential structural and regulatory interactions supported by CTCF binding sites, DNase accessibility, and/or active histone marks. CTCF-associated interactions are most strongly enriched in the middle genomic distance range (similar to 700 kb-1.5Mb), while interactions involving actively marked DNase accessible elements are enriched both at short (<500 kb) and longer (>1.5 Mb) genomic distances. There is a striking enrichment of longer-range interactions connecting replication-dependent histone genes on chromosome 6, potentially representing the chromatin architecture at the histone locus body.
引用
收藏
页数:10
相关论文
共 20 条
[1]   Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts [J].
Ay, Ferhat ;
Bailey, Timothy L. ;
Noble, William Stafford .
GENOME RESEARCH, 2014, 24 (06) :999-1011
[2]   CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data [J].
Cairns, Jonathan ;
Freire-Pritchett, Paula ;
Wingett, Steven W. ;
Varnai, Csilla ;
Dimond, Andrew ;
Plagnol, Vincent ;
Zerbino, Daniel ;
Schoenfelder, Stefan ;
Javierre, Biola-Maria ;
Osborne, Cameron ;
Fraser, Peter ;
Spivakov, Mikhail .
GENOME BIOLOGY, 2016, 17
[3]   Topological domains in mammalian genomes identified by analysis of chromatin interactions [J].
Dixon, Jesse R. ;
Selvaraj, Siddarth ;
Yue, Feng ;
Kim, Audrey ;
Li, Yan ;
Shen, Yin ;
Hu, Ming ;
Liu, Jun S. ;
Ren, Bing .
NATURE, 2012, 485 (7398) :376-380
[4]   A three-dimensional model of the yeast genome [J].
Duan, Zhijun ;
Andronescu, Mirela ;
Schutz, Kevin ;
McIlwain, Sean ;
Kim, Yoo Jung ;
Lee, Choli ;
Shendure, Jay ;
Fields, Stanley ;
Blau, C. Anthony ;
Noble, William S. .
NATURE, 2010, 465 (7296) :363-367
[5]   ChromHMM: automating chromatin-state discovery and characterization [J].
Ernst, Jason ;
Kellis, Manolis .
NATURE METHODS, 2012, 9 (03) :215-216
[6]  
Hoffman MM, 2012, NAT METHODS, V9, P473, DOI [10.1038/NMETH.1937, 10.1038/nmeth.1937]
[7]   HiCNorm: removing biases in Hi-C data via Poisson regression [J].
Hu, Ming ;
Deng, Ke ;
Selvaraj, Siddarth ;
Qin, Zhaohui ;
Ren, Bing ;
Liu, Jun S. .
BIOINFORMATICS, 2012, 28 (23) :3131-3133
[8]  
Imakaev M, 2012, NAT METHODS, V9, P999, DOI [10.1038/nmeth.2148, 10.1038/NMETH.2148]
[9]   FourCSeq: analysis of 4C sequencing data [J].
Klein, Felix A. ;
Pakozdi, Tibor ;
Anders, Simon ;
Ghavi-Helm, Yad ;
Furlong, Eileen E. M. ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (19) :3085-3091
[10]   Comprehensive Mapping of Long-Range Interactions Reveals Folding Principles of the Human Genome [J].
Lieberman-Aiden, Erez ;
van Berkum, Nynke L. ;
Williams, Louise ;
Imakaev, Maxim ;
Ragoczy, Tobias ;
Telling, Agnes ;
Amit, Ido ;
Lajoie, Bryan R. ;
Sabo, Peter J. ;
Dorschner, Michael O. ;
Sandstrom, Richard ;
Bernstein, Bradley ;
Bender, M. A. ;
Groudine, Mark ;
Gnirke, Andreas ;
Stamatoyannopoulos, John ;
Mirny, Leonid A. ;
Lander, Eric S. ;
Dekker, Job .
SCIENCE, 2009, 326 (5950) :289-293