mRIN for direct assessment of genome-wide and gene-specific mRNA integrity from large-scale RNA-sequencing data

被引:42
|
作者
Feng, Huijuan [1 ,2 ,3 ]
Zhang, Xuegong [1 ,2 ]
Zhang, Chaolin [3 ]
机构
[1] Tsinghua Univ, MOE Key Lab Bioinformat, Beijing 100084, Peoples R China
[2] Tsinghua Univ, TNLIST, Bioinformat Div, Dept Automat, Beijing 100084, Peoples R China
[3] Columbia Univ, Dept Syst Biol, Dept Biochem & Mol Biophys, Ctr Motor Neuron Biol & Dis, New York, NY 10032 USA
来源
NATURE COMMUNICATIONS | 2015年 / 6卷
基金
美国国家卫生研究院;
关键词
QUALITY-CONTROL; HUMAN BRAIN; SEQ DATA; ISOFORM EXPRESSION; DEGRADATION; DECAY; TRANSCRIPTOME; QUANTIFICATION; SITES; MOUSE;
D O I
10.1038/ncomms8816
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The volume of RNA-Seq data sets in public repositories has been expanding exponentially, providing unprecedented opportunities to study gene expression regulation. Because degraded RNA samples, such as those collected from post-mortem tissues, can result in distinct expression profiles with potential biases, a particularly important step in mining these data is quality control. Here we develop a method named mRIN to directly assess mRNA integrity from RNA-Seq data at the sample and individual gene level. We systematically analyse large-scale RNA-Seq data sets of the human brain transcriptome generated by different consortia. Our analysis demonstrates that 3' bias resulting from partial RNA fragmentation in post-mortem tissues has a marked impact on global expression profiles, and that mRIN effectively identifies samples with different levels of mRNA degradation. Unexpectedly, this process has a reproducible and gene-specific component, and transcripts with different stabilities are associated with distinct functions and structural features reminiscent of mRNA decay in living cells.
引用
收藏
页数:10
相关论文
共 9 条
  • [1] A high-efficiency differential expression method for cancer heterogeneity using large-scale single-cell RNA-sequencing data
    Yuan, Xin
    Ma, Shuangge
    Fa, Botao
    Wei, Ting
    Ma, Yanran
    Wang, Yifan
    Lv, Wenwen
    Zhang, Yue
    Zheng, Junke
    Chen, Guoqiang
    Sun, Jing
    Yu, Zhangsheng
    FRONTIERS IN GENETICS, 2022, 13
  • [2] Integrating Genome-Wide Association Study with RNA-Sequencing Reveals HDAC9 as a Candidate Gene Influencing Loin Muscle Area in Beijing Black Pigs
    Hou, Renda
    Chen, Li
    Liu, Xiance
    Liu, Hai
    Shi, Guohua
    Hou, Xinhua
    Zhang, Run
    Yang, Man
    Niu, Naiqi
    Wang, Lixian
    Zhang, Longchao
    BIOLOGY-BASEL, 2022, 11 (11):
  • [3] Comparison of Genome-Wide and Gene-Specific DNA Methylation Profiling in First-Trimester Chorionic Villi From Pregnancies Conceived With Infertility Treatments
    Xu, Ning
    Barlow, Gillian M.
    Cui, Jinrui
    Wang, Erica T.
    Lee, Bora
    Akhlaghpour, Marzieh
    Kroener, Lindsay
    Williams, John, III
    Rotter, Jerome I.
    Chen, Yii-der I.
    Goodarzi, Mark O.
    Pisarska, Margareta D.
    REPRODUCTIVE SCIENCES, 2017, 24 (07) : 996 - 1004
  • [4] Leveraging large-scale multi-omics evidences to identify therapeutic targets from genome-wide association studies
    Lessard, Samuel
    Chao, Michael
    Reis, Kadri
    Beauvais, Mathieu
    Rajpal, Deepak K.
    Sloane, Jennifer
    Palta, Priit
    Klinger, Katherine
    de Rinaldis, Emanuele
    Shameer, Khader
    Chatelain, Clement
    BMC GENOMICS, 2024, 25 (01):
  • [5] CHESS: a new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise
    Pertea, Mihaela
    Shumate, Alaina
    Pertea, Geo
    Varabyou, Ales
    Breitwieser, Florian P.
    Chang, Yu-Chi
    Madugundu, Anil K.
    Pandey, Akhilesh
    Salzberg, Steven L.
    GENOME BIOLOGY, 2018, 19
  • [6] Construction of PRDM9 allele-specific recombination maps in cattle using large-scale pedigree analysis and genome-wide single sperm genomics
    Zhou, Yang
    Shen, Botong
    Jiang, Jicai
    Padhi, Abinash
    Park, Ki-Eun
    Oswalt, Adam
    Sattler, Charles G.
    Telugu, Bhanu P.
    Chen, Hong
    Cole, John B.
    Liu, George E.
    Ma, Li
    DNA RESEARCH, 2018, 25 (02) : 183 - 194
  • [7] CHESS: a new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise
    Mihaela Pertea
    Alaina Shumate
    Geo Pertea
    Ales Varabyou
    Florian P. Breitwieser
    Yu-Chi Chang
    Anil K. Madugundu
    Akhilesh Pandey
    Steven L. Salzberg
    Genome Biology, 19
  • [8] Large-Scale Development of Gene-Associated Single-Nucleotide Polymorphism Markers for Molluscan Population Genomic, Comparative Genomic, and Genome-Wide Association Studies
    Jiao, Wenqian
    Fu, Xiaoteng
    Li, Jinqin
    Li, Ling
    Feng, Liying
    Lv, Jia
    Zhang, Lu
    Wang, Xiaojian
    Li, Yangping
    Hou, Rui
    Zhang, Lingling
    Hu, Xiaoli
    Wang, Shi
    Bao, Zhenmin
    DNA RESEARCH, 2014, 21 (02) : 183 - 193
  • [9] Post-modified non-negative matrix factorization for deconvoluting the gene expression profiles of specific cell types from heterogeneous clinical samples based on RNA-sequencing data
    Liu, Yuan
    Liang, Yu
    Kuang, Qifan
    Xie, Fanfan
    Hao, Yingyi
    Wen, Zhining
    Li, Menglong
    JOURNAL OF CHEMOMETRICS, 2018, 32 (11)