Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays

被引:78
|
作者
Agarwal, Ashish [1 ]
Koppstein, David [1 ]
Rozowsky, Joel [1 ]
Sboner, Andrea [1 ]
Habegger, Lukas [1 ]
Hillier, LaDeana W. [3 ]
Sasidharan, Rajkumar [1 ]
Reinke, Valerie [4 ]
Waterston, Robert H. [3 ]
Gerstein, Mark [1 ,2 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[2] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[3] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[4] Yale Univ, Sch Med, Dept Genet, New Haven, CT 06520 USA
来源
BMC GENOMICS | 2010年 / 11卷
关键词
GENE-EXPRESSION; EUKARYOTIC TRANSCRIPTOME; MICROARRAY; IDENTIFICATION; NORMALIZATION; ANNOTATION; PREDICTION;
D O I
10.1186/1471-2164-11-383
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these technologies will help researchers select the appropriate technology for their needs. Results: Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from the second larval stage of C. elegans. We find that the raw signals from these two technologies are reasonably well correlated but that RNA-Seq outperforms tiling arrays in several respects, notably in exon boundary detection and dynamic range of expression. By exploring the accuracy of sequencing as a function of depth of coverage, we found that about 4 million reads are required to match the sensitivity of two tiling array replicates. The effects of cross-hybridization were analyzed using a "nearest neighbor" classifier applied to array probes; we describe a method for determining potential "black list" regions whose signals are unreliable. Finally, we propose a strategy for using RNA-Seq data as a gold standard set to calibrate tiling array data. All tiling array and RNA-Seq data sets have been submitted to the modENCODE Data Coordinating Center. Conclusions: Tiling arrays effectively detect transcript expression levels at a low cost for many species while RNA-Seq provides greater accuracy in several regards. Researchers will need to carefully select the technology appropriate to the biological investigations they are undertaking. It will also be important to reconsider a comparison such as ours as sequencing technologies continue to evolve.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays
    Ashish Agarwal
    David Koppstein
    Joel Rozowsky
    Andrea Sboner
    Lukas Habegger
    LaDeana W Hillier
    Rajkumar Sasidharan
    Valerie Reinke
    Robert H Waterston
    Mark Gerstein
    BMC Genomics, 11
  • [2] Comparison of the myometrial transcriptome from singleton and twin pregnancies by RNA-Seq
    Arrowsmith, Sarah
    Fang, Yongxiang
    Sharp, Andrew
    PLOS ONE, 2020, 15 (01):
  • [3] Evaluation of de novo transcriptome assemblies from RNA-Seq data
    Li, Bo
    Fillmore, Nathanael
    Bai, Yongsheng
    Collins, Mike
    Thomson, James A.
    Stewart, Ron
    Dewey, Colin N.
    GENOME BIOLOGY, 2014, 15 (12):
  • [4] Evaluation of de novo transcriptome assemblies from RNA-Seq data
    Bo Li
    Nathanael Fillmore
    Yongsheng Bai
    Mike Collins
    James A Thomson
    Ron Stewart
    Colin N Dewey
    Genome Biology, 15
  • [5] Transcriptome assembly and quantification from Ion Torrent RNA-Seq data
    Mangul, Serghei
    Caciula, Adrian
    Al Seesi, Sahar
    Brinza, Dumitru
    Mondoiu, Ion
    Zelikovsky, Alex
    BMC GENOMICS, 2014, 15
  • [6] Transcriptome assembly and quantification from Ion Torrent RNA-Seq data
    Serghei Mangul
    Adrian Caciula
    Sahar Al Seesi
    Dumitru Brinza
    Ion Mӑndoiu
    Alex Zelikovsky
    BMC Genomics, 15
  • [7] Taming the Transcriptome with RNA-Seq
    Stein R.A.
    2018, Mary Ann Liebert Inc. (38): : 10 - 12
  • [8] RNA-seq dissects the transcriptome
    Genet. Eng. Biotechnol. News, 13 (20-22):
  • [9] Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets
    Tjaden, Brian
    RNA BIOLOGY, 2023, 20 (01) : 77 - 84
  • [10] Ancestral transcriptome inference based on RNA-Seq and ChIP-seq data
    Yang, Jingwen
    Ruan, Hang
    Zou, Yangyun
    Su, Zhixi
    Gu, Xun
    METHODS, 2020, 176 : 99 - 105