Cicero Predicts cis-Regulatory DNA Interactions from Single-Cell Chromatin Accessibility Data

被引:461
作者
Pliner, Hannah A. [1 ]
Packer, Jonathan S. [1 ]
McFaline-Figueroa, Jose L. [1 ]
Cusanovich, Darren A. [1 ]
Daza, Riza M. [1 ]
Aghamirzaie, Delasa [1 ]
Srivatsan, Sanjay [1 ]
Qiu, Xiaojie [1 ,2 ]
Jackson, Dana [1 ]
Minkina, Anna [1 ]
Adey, Andrew C. [3 ]
Steemers, Frank J. [4 ]
Shendure, Jay [1 ,5 ,6 ]
Trapnell, Cole [1 ,6 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Univ Washington, Mol & Cellular Biol Program, Seattle, WA 98195 USA
[3] Oregon Hlth & Sci Univ, Dept Mol & Med Genet, Portland, OR 97201 USA
[4] Illumina Inc, San Diego, CA USA
[5] Howard Hughes Med Inst, Seattle, WA 98195 USA
[6] Brotman Baty Inst Precis Med, Seattle, WA 98195 USA
关键词
GENE-EXPRESSION; READ ALIGNMENT; HUMAN GENOME; MYOD; BINDING; ACTIVATION; P300; TRANSPOSITION; ORGANIZATION; PROTEINS;
D O I
10.1016/j.molcel.2018.06.044
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Linking regulatory DNA elements to their target genes, which may be located hundreds of kilobases away, remains challenging. Here, we introduce Cicero, an algorithm that identifies co-accessible pairs of DNA elements using single-cell chromatin accessibility data and so connects regulatory elements to their putative target genes. We apply Cicero to investigate how dynamically accessible elements orchestrate gene regulation in differentiating myoblasts. Groups of Cicero-linked regulatory elements meet criteria of "chromatin hubs"-they are enriched for physical proximity, interact with a common set of transcription factors, and undergo coordinated changes in histone marks that are predictive of changes in gene expression. Pseudotemporal analysis revealed that most DNA elements remain in chromatin hubs throughout differentiation. A subset of elements bound by MYOD1 in myoblasts exhibit early opening in a PBX1- and MEIS1-dependent manner. Our strategy can be applied to dissect the architecture, sequence determinants, and mechanisms of cis-regulation on a genome-wide scale.
引用
收藏
页码:858 / +
页数:22
相关论文
共 62 条
[1]   Haplotype-resolved whole-genome sequencing by contiguity-preserving transposition and combinatorial indexing [J].
Amini, Sasan ;
Pushkarev, Dmitry ;
Christiansen, Lena ;
Kostem, Emrah ;
Royce, Tom ;
Turk, Casey ;
Pignatelli, Natasha ;
Adey, Andrew ;
Kitzman, Jacob O. ;
Vijayan, Kandaswamy ;
Ronaghi, Mostafa ;
Shendure, Jay ;
Gunderson, Kevin L. ;
Steemers, Frank J. .
NATURE GENETICS, 2014, 46 (12) :1343-1349
[2]  
[Anonymous], BIORXIV
[3]  
[Anonymous], 2013, FNN: Fast Nearest Neighbor Search Algorithms and Applications
[4]   THE PROTEIN ID - A NEGATIVE REGULATOR OF HELIX-LOOP-HELIX DNA-BINDING PROTEINS [J].
BENEZRA, R ;
DAVIS, RL ;
LOCKSHON, D ;
TURNER, DL ;
WEINTRAUB, H .
CELL, 1990, 61 (01) :49-59
[5]   Pbx marks genes for activation by MyoD indicating a role for a homeodomain protein in establishing myogenic potential [J].
Berkes, CA ;
Bergstrom, DA ;
Penn, BH ;
Seaver, KJ ;
Knoepfler, PS ;
Tapscott, SJ .
MOLECULAR CELL, 2004, 14 (04) :465-477
[6]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[7]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[8]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[9]   Predictive modelling of gene expression from transcriptional regulatory elements [J].
Budden, David M. ;
Hurley, Daniel G. ;
Crampin, Edmund J. .
BRIEFINGS IN BIOINFORMATICS, 2015, 16 (04) :616-628
[10]  
Buenrostro JD, 2013, NAT METHODS, V10, P1213, DOI [10.1038/NMETH.2688, 10.1038/nmeth.2688]