Regulatory activity is the default DNA state in eukaryotes

被引:7
作者
Luthra, Ishika [1 ]
Jensen, Cassandra [1 ]
Chen, Xinyi E. [1 ]
Salaudeen, Asfar Lathif [1 ]
Rafi, Abdul Muntakim [1 ]
de Boer, Carl G. [1 ]
机构
[1] Univ British Columbia, Sch Biomed Engn, Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
HORIZONTAL-GENE-TRANSFER; LONG NONCODING RNAS; SEQUENCE; EVOLUTION; REVEALS; GENOME; PROTEINS; FRACTION; ENCODE;
D O I
10.1038/s41594-024-01235-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genomes encode for genes and non-coding DNA, both capable of transcriptional activity. However, unlike canonical genes, many transcripts from non-coding DNA have limited evidence of conservation or function. Here, to determine how much biological noise is expected from non-genic sequences, we quantify the regulatory activity of evolutionarily naive DNA using RNA-seq in yeast and computational predictions in humans. In yeast, more than 99% of naive DNA bases were transcribed. Unlike the evolved transcriptome, naive transcripts frequently overlapped with opposite sense transcripts, suggesting selection favored coherent gene structures in the yeast genome. In humans, regulation-associated chromatin activity is predicted to be common in naive dinucleotide-content-matched randomized DNA. Here, naive and evolved DNA have similar co-occurrence and cell-type specificity of chromatin marks, challenging these as indicators of selection. However, in both yeast and humans, extreme high activities were rare in naive DNA, suggesting they result from selection. Overall, basal regulatory activity seems to be the default, which selection can hone to evolve a function or, if detrimental, repress. Here, the authors ask how much regulatory activity DNA is expected to have in the absence of selection. In yeast and humans, they find that gene regulatory activity is common in evolutionarily naive DNA, suggesting that activity is not always indicative of function.
引用
收藏
页码:559 / 567
页数:25
相关论文
共 86 条
[1]   Predicting mRNA Abundance Directly from Genomic Sequence Using Deep Convolutional Neural Networks [J].
Agarwal, Vikram ;
Shendure, Jay .
CELL REPORTS, 2020, 31 (07)
[2]  
Andrews S, 2010, FastQC: A quality control tool for high throughput sequence data
[3]   Effective gene expression prediction from sequence by integrating long-range interactions [J].
Avsec, Ziga ;
Agarwal, Vikram ;
Visentin, Daniel ;
Ledsam, Joseph R. ;
Grabska-Barwinska, Agnieszka ;
Taylor, Kyle R. ;
Assael, Yannis ;
Jumper, John ;
Kohli, Pushmeet ;
Kelley, David R. .
NATURE METHODS, 2021, 18 (10) :1196-+
[4]  
Bailey TL., 2021, BIORXIV, DOI DOI 10.1101/2021.08.23.457422
[5]   DNA METHYLATION AND THE FREQUENCY OF CPG IN ANIMAL DNA [J].
BIRD, AP .
NUCLEIC ACIDS RESEARCH, 1980, 8 (07) :1499-1504
[6]  
Blackburn, YEAST COLONY PCR V20
[7]   Uncovering de novo gene birth in yeast using deep transcriptomics [J].
Blevins, William R. ;
Ruiz-Orera, Jorge ;
Messeguer, Xavier ;
Blasco-Moreno, Bernat ;
Luis Villanueva-Canas, Jose ;
Espinar, Lorena ;
Diez, Juana ;
Carey, Lucas B. ;
Mar Alba, M. .
NATURE COMMUNICATIONS, 2021, 12 (01)
[8]   A Deep Neural Network for Predicting and Engineering Alternative Polyadenylation [J].
Bogard, Nicholas ;
Linder, Johannes ;
Rosenberg, Alexander B. ;
Seelig, Georg .
CELL, 2019, 178 (01) :91-+
[9]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[10]   Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses [J].
Cabili, Moran N. ;
Trapnell, Cole ;
Goff, Loyal ;
Koziol, Magdalena ;
Tazon-Vega, Barbara ;
Regev, Aviv ;
Rinn, John L. .
GENES & DEVELOPMENT, 2011, 25 (18) :1915-1927