共 43 条
Corset: enabling differential gene expression analysis for de novo assembled transcriptomes
被引:579
作者:
Davidson, Nadia M.
[1
]
Oshlack, Alicia
[1
,2
]
机构:
[1] Royal Childrens Hosp, Murdoch Childrens Res Inst, Parkville, Vic 3052, Australia
[2] Univ Melbourne, Dept Genet, Melbourne, Vic, Australia
基金:
澳大利亚国家健康与医学研究理事会;
关键词:
RNA-SEQ DATA;
GENERATION;
READS;
OPTIMIZATION;
ANNOTATION;
DISCOVERY;
BLAST;
WHEAT;
D O I:
10.1186/s13059-014-0410-6
中图分类号:
Q81 [生物工程学(生物技术)];
Q93 [微生物学];
学科分类号:
071005 ;
0836 ;
090102 ;
100705 ;
摘要:
Next generation sequencing has made it possible to perform differential gene expression studies in non-model organisms. For these studies, the need for a reference genome is circumvented by performing de novo assembly on the RNA-seq data. However, transcriptome assembly produces a multitude of contigs, which must be clustered into genes prior to differential gene expression detection. Here we present Corset, a method that hierarchically clusters contigs using shared reads and expression, then summarizes read counts to clusters, ready for statistical testing. Using a range of metrics, we demonstrate that Corset out-performs alternative methods. Corset is available from https://code.google.com/p/corset-project/.
引用
收藏
页数:14
相关论文