Systematic comparison of RNA-Seq normalization methods using measurement error models

被引:17
|
作者
Sun, Zhaonan [1 ]
Zhu, Yu [1 ]
机构
[1] Purdue Univ, Dept Stat, W Lafayette, IN 47906 USA
关键词
TRANSCRIPTOME; EXPRESSION; ARRAYS; GENOME; BIASES;
D O I
10.1093/bioinformatics/bts497
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Further advancement of RNA-Seq technology and its application call for the development of effective normalization methods for RNA-Seq data. Currently, different normalization methods are compared and validated by their correlations with a certain gold standard. Gene expression measurements generated by a different technology or platform such as Real-time reverse transcription polymerase chain reaction (qRT-PCR) or Microarray are usually used as the gold standard. Although the current approach is intuitive and easy to implement, it becomes statistically inadequate when the gold standard is also subject to measurement error (ME). Furthermore, the current approach is not informative, because the correlation of a normalization method with a certain gold standard does not provide much information about the exact quality of the normalized RNA-Seq measurements. Results: We propose to use the system of ME models based on qRT-PCR, Microarray and RNA-Seq gene expression data to compare and validate RNA-Seq normalization methods. This approach does not assume the existence of a gold standard. The performance of a normalization method can be characterized by a group of parameters of the system, which are referred to as the performance parameters, and these performance parameters can be consistently estimated. Different normalization methods can thus be compared by comparing their corresponding estimated performance parameters. We applied the proposed approach to compare five existing RNA-Seq normalization methods using the gene expression data of two RNA samples from the microArray Quality Control and Sequencing Quality Control projects and gained much insight about the pros and cons of these methods.
引用
收藏
页码:2584 / 2591
页数:8
相关论文
共 50 条
  • [21] Removing technical variability in RNA-seq data using conditional quantile normalization
    Hansen, Kasper D.
    Irizarry, Rafael A.
    WU, Zhijin
    BIOSTATISTICS, 2012, 13 (02) : 204 - 216
  • [22] A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data
    Li, Xiaohong
    Brock, Guy N.
    Rouchka, Eric C.
    Cooper, Nigel G. F.
    Wu, Dongfeng
    O'Toole, Timothy E.
    Gill, Ryan S.
    Eteleeb, Abdallah M.
    O'Brien, Liz
    Rail, Shesh N.
    PLOS ONE, 2017, 12 (05):
  • [23] STATISTICAL CALIBRATION OF QRT-PCR, MICROARRAY AND RNA-SEQ GENE EXPRESSION DATA WITH MEASUREMENT ERROR MODELS
    Sun, Zhaonan
    Kuczek, Thomas
    Zhu, Yu
    ANNALS OF APPLIED STATISTICS, 2014, 8 (02): : 1022 - 1044
  • [24] Development and evaluation of RNA-seq methods
    Levin, Joshua
    Adiconis, Xian
    Yassour, Moran
    Thompson, Dawn
    Guttman, Mitchell
    Berger, Michael
    Fan, Lin
    Friedman, Nir
    Nusbaum, Chad
    Gnirke, Andreas
    Regev, Aviv
    GENOME BIOLOGY, 2010, 11
  • [25] RNA-Seq methods for transcriptome analysis
    Hrdlickova, Radmila
    Toloue, Masoud
    Tian, Bin
    WILEY INTERDISCIPLINARY REVIEWS-RNA, 2017, 8 (01)
  • [26] Development and evaluation of RNA-seq methods
    Joshua Levin
    Xian Adiconis
    Moran Yassour
    Dawn Thompson
    Mitchell Guttman
    Michael Berger
    Lin Fan
    Nir Friedman
    Chad Nusbaum
    Andreas Gnirke
    Aviv Regev
    Genome Biology, 11 (Suppl 1)
  • [27] Comparison of normalization and differential expression analyses using RNA-Seq data from 726 individual Drosophila melanogaster
    Lin, Yanzhu
    Golovnina, Kseniya
    Chen, Zhen-Xia
    Lee, Hang Noh
    Negron, Yazmin L. Serrano
    Sultana, Hina
    Oliver, Brian
    Harbison, Susan T.
    BMC GENOMICS, 2016, 17
  • [28] Comparison of normalization and differential expression analyses using RNA-Seq data from 726 individual Drosophila melanogaster
    Yanzhu Lin
    Kseniya Golovnina
    Zhen-Xia Chen
    Hang Noh Lee
    Yazmin L. Serrano Negron
    Hina Sultana
    Brian Oliver
    Susan T. Harbison
    BMC Genomics, 17
  • [29] Systematic Selection of Reference Genes for the Normalization of Circulating RNA Transcripts in Pregnant Women Based on RNA-Seq Data
    Chim, Stephen S. C.
    Wong, Karen K. W.
    Chung, Claire Y. L.
    Lam, Stephanie K. W.
    Kwok, Jamie S. L.
    Lai, Chit-Ying
    Cheng, Yvonne K. Y.
    Hui, Annie S. Y.
    Meng, Meng
    Chan, Oi-Ka
    Tsui, Stephen K. W.
    Lee, Keun-Young
    Chan, Ting-Fung
    Leung, Tak-Yeung
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (08)
  • [30] Computational methods for transcriptome annotation and quantification using RNA-seq
    Garber, Manuel
    Grabherr, Manfred G.
    Guttman, Mitchell
    Trapnell, Cole
    NATURE METHODS, 2011, 8 (06) : 469 - 477