Corset: enabling differential gene expression analysis for de novo assembled transcriptomes

被引:579
作者
Davidson, Nadia M. [1 ]
Oshlack, Alicia [1 ,2 ]
机构
[1] Royal Childrens Hosp, Murdoch Childrens Res Inst, Parkville, Vic 3052, Australia
[2] Univ Melbourne, Dept Genet, Melbourne, Vic, Australia
基金
澳大利亚国家健康与医学研究理事会;
关键词
RNA-SEQ DATA; GENERATION; READS; OPTIMIZATION; ANNOTATION; DISCOVERY; BLAST; WHEAT;
D O I
10.1186/s13059-014-0410-6
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Next generation sequencing has made it possible to perform differential gene expression studies in non-model organisms. For these studies, the need for a reference genome is circumvented by performing de novo assembly on the RNA-seq data. However, transcriptome assembly produces a multitude of contigs, which must be clustered into genes prior to differential gene expression detection. Here we present Corset, a method that hierarchically clusters contigs using shared reads and expression, then summarizes read counts to clusters, ready for statistical testing. Using a range of metrics, we demonstrate that Corset out-performs alternative methods. Corset is available from https://code.google.com/p/corset-project/.
引用
收藏
页数:14
相关论文
共 43 条
[41]   Velvet: Algorithms for de novo short read assembly using de Bruijn graphs [J].
Zerbino, Daniel R. ;
Birney, Ewan .
GENOME RESEARCH, 2008, 18 (05) :821-829
[42]   A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies [J].
Zhang, Wenyu ;
Chen, Jiajia ;
Yang, Yang ;
Tang, Yifei ;
Shang, Jing ;
Shen, Bairong .
PLOS ONE, 2011, 6 (03)
[43]   Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study [J].
Zhao, Qiong-Yi ;
Wang, Yi ;
Kong, Yi-Meng ;
Luo, Da ;
Li, Xuan ;
Hao, Pei .
BMC BIOINFORMATICS, 2011, 12 :S2