Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays

被引:78
|
作者
Agarwal, Ashish [1 ]
Koppstein, David [1 ]
Rozowsky, Joel [1 ]
Sboner, Andrea [1 ]
Habegger, Lukas [1 ]
Hillier, LaDeana W. [3 ]
Sasidharan, Rajkumar [1 ]
Reinke, Valerie [4 ]
Waterston, Robert H. [3 ]
Gerstein, Mark [1 ,2 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[2] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[3] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[4] Yale Univ, Sch Med, Dept Genet, New Haven, CT 06520 USA
来源
BMC GENOMICS | 2010年 / 11卷
关键词
GENE-EXPRESSION; EUKARYOTIC TRANSCRIPTOME; MICROARRAY; IDENTIFICATION; NORMALIZATION; ANNOTATION; PREDICTION;
D O I
10.1186/1471-2164-11-383
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these technologies will help researchers select the appropriate technology for their needs. Results: Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from the second larval stage of C. elegans. We find that the raw signals from these two technologies are reasonably well correlated but that RNA-Seq outperforms tiling arrays in several respects, notably in exon boundary detection and dynamic range of expression. By exploring the accuracy of sequencing as a function of depth of coverage, we found that about 4 million reads are required to match the sensitivity of two tiling array replicates. The effects of cross-hybridization were analyzed using a "nearest neighbor" classifier applied to array probes; we describe a method for determining potential "black list" regions whose signals are unreliable. Finally, we propose a strategy for using RNA-Seq data as a gold standard set to calibrate tiling array data. All tiling array and RNA-Seq data sets have been submitted to the modENCODE Data Coordinating Center. Conclusions: Tiling arrays effectively detect transcript expression levels at a low cost for many species while RNA-Seq provides greater accuracy in several regards. Researchers will need to carefully select the technology appropriate to the biological investigations they are undertaking. It will also be important to reconsider a comparison such as ours as sequencing technologies continue to evolve.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Comparison of the myometrial transcriptome from singleton and twin pregnancies by RNA-Seq
    Arrowsmith, Sarah
    Fang, Yongxiang
    Sharp, Andrew
    PLOS ONE, 2020, 15 (01):
  • [2] Logic programming to infer complex RNA expression patterns from RNA-seq data
    Weirick, Tyler
    Militello, Giuseppe
    Ponomareva, Yuliya
    John, David
    Doring, Claudia
    Dimmeler, Stefanie
    Uchida, Shizuka
    BRIEFINGS IN BIOINFORMATICS, 2018, 19 (02) : 199 - 209
  • [3] Comparison of RNA-Seq and Microarray in Transcriptome Profiling of Activated T Cells
    Zhao, Shanrong
    Fung-Leung, Wai-Ping
    Bittner, Anton
    Ngo, Karen
    Liu, Xuejun
    PLOS ONE, 2014, 9 (01):
  • [4] Ancestral transcriptome inference based on RNA-Seq and ChIP-seq data
    Yang, Jingwen
    Ruan, Hang
    Zou, Yangyun
    Su, Zhixi
    Gu, Xun
    METHODS, 2020, 176 : 99 - 105
  • [5] An empirical strategy to detect bacterial transcript structure from directional RNA-seq transcriptome data
    Wang, Yejun
    MacKenzie, Keith D.
    White, Aaron P.
    BMC GENOMICS, 2015, 16
  • [6] A comparison of methods for differential expression analysis of RNA-seq data
    Soneson, Charlotte
    Delorenzi, Mauro
    BMC BIOINFORMATICS, 2013, 14
  • [7] RNA-Seq methods for transcriptome analysis
    Hrdlickova, Radmila
    Toloue, Masoud
    Tian, Bin
    WILEY INTERDISCIPLINARY REVIEWS-RNA, 2017, 8 (01)
  • [8] A new method for stranded whole transcriptome RNA-seq
    Miller, David F. B.
    Yan, Pearlly S.
    Buechlein, Aaron
    Rodriguez, Benjamin A.
    Yilmaz, Ayse S.
    Goel, Shokhi
    Lin, Hai
    Collins-Burow, Bridgette
    Rhodes, Lyndsay V.
    Braun, Chris
    Pradeep, Sunila
    Rupaimoole, Rajesha
    Dalkilic, Mehmet
    Sood, Anil K.
    Burow, Matthew E.
    Tang, Haixu
    Huang, Tim H.
    Liu, Yunlong
    Rusch, Douglas B.
    Nephew, Kenneth P.
    METHODS, 2013, 63 (02) : 126 - 134
  • [9] Transcriptome analysis of rice root heterosis by RNA-Seq
    Zhai, Rongrong
    Feng, Yue
    Wang, Huimin
    Zhan, Xiaodeng
    Shen, Xihong
    Wu, Weiming
    Zhang, Yingxin
    Chen, Daibo
    Dai, Gaoxing
    Yang, Zhanlie
    Cao, Liyong
    Cheng, Shihua
    BMC GENOMICS, 2013, 14
  • [10] A Framework for Comparison and Assessment of Synthetic RNA-Seq Data
    Shakola, Felitsiya
    Palejev, Dean
    Ivanov, Ivan
    GENES, 2022, 13 (12)