Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis

被引:28
作者
Cordero, Francesca [1 ,2 ]
Beccuti, Marco [2 ]
Arigoni, Maddalena [1 ]
Donatelli, Susanna [2 ]
Calogero, Raffaele A. [1 ]
机构
[1] Univ Turin, Dept Comp Sci, Turin, Italy
[2] Univ Turin, Ctr Mol Biotechnol, Turin, Italy
关键词
DIFFERENTIAL EXPRESSION; BIOCONDUCTOR PACKAGE; RAPID ALIGNMENT; SEQ; GENES; NORMALIZATION; TOOLS; BIAS;
D O I
10.1371/journal.pone.0031630
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Massive Parallel Sequencing methods (MPS) can extend and improve the knowledge obtained by conventional microarray technology, both for mRNAs and short non-coding RNAs, e. g. miRNAs. The processing methods used to extract and interpret the information are an important aspect of dealing with the vast amounts of data generated from short read sequencing. Although the number of computational tools for MPS data analysis is constantly growing, their strengths and weaknesses as part of a complex analytical pipe-line have not yet been well investigated. Primary findings: A benchmark MPS miRNA dataset, resembling a situation in which miRNAs are spiked in biological replication experiments was assembled by merging a publicly available MPS spike-in miRNAs data set with MPS data derived from healthy donor peripheral blood mononuclear cells. Using this data set we observed that short reads counts estimation is strongly under estimated in case of duplicates miRNAs, if whole genome is used as reference. Furthermore, the sensitivity of miRNAs detection is strongly dependent by the primary tool used in the analysis. Within the six aligners tested, specifically devoted to miRNA detection, SHRiMP and MicroRazerS show the highest sensitivity. Differential expression estimation is quite efficient. Within the five tools investigated, two of them (DESseq, baySeq) show a very good specificity and sensitivity in the detection of differential expression. Conclusions: The results provided by our analysis allow the definition of a clear and simple analytical optimized workflow for miRNAs digital quantitative analysis.
引用
收藏
页数:10
相关论文
共 34 条
[1]   Barcoding bias in high-throughput multiplex sequencing of miRNA [J].
Alon, Shahar ;
Vigneault, Francois ;
Eminaga, Seda ;
Christodoulou, Danos C. ;
Seidman, Jonathan G. ;
Church, George M. ;
Eisenberg, Eli .
GENOME RESEARCH, 2011, 21 (09) :1506-1511
[2]   A uniform system for microRNA annotation [J].
Ambros, V ;
Bartel, B ;
Bartel, DP ;
Burge, CB ;
Carrington, JC ;
Chen, XM ;
Dreyfuss, G ;
Eddy, SR ;
Griffiths-Jones, S ;
Marshall, M ;
Matzke, M ;
Ruvkun, G ;
Tuschl, T .
RNA, 2003, 9 (03) :277-279
[3]  
Anders S., 2010, GENOME BIOL, V11, pR106, DOI [10.1186/gb-2010-11-10-r106, DOI 10.1186/gb-2010-11-10-r106]
[4]   Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments [J].
Breitling, R ;
Armengaud, P ;
Amtmann, A ;
Herzyk, P .
FEBS LETTERS, 2004, 573 (1-3) :83-92
[5]   Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments [J].
Bullard, James H. ;
Purdom, Elizabeth ;
Hansen, Kasper D. ;
Dudoit, Sandrine .
BMC BIOINFORMATICS, 2010, 11
[6]  
Califano A, 1993, Proc Int Conf Intell Syst Mol Biol, V1, P56
[7]   Dissecting an alternative splicing analysis workflow for GeneChip® Exon 1.0 ST Affymetrix arrays [J].
Della Beffa, Cristina ;
Cordero, Francesca ;
Calogero, Raffaele A. .
BMC GENOMICS, 2008, 9 (1)
[8]   MicroRazerS: rapid alignment of small RNA reads [J].
Emde, Anne-Katrin ;
Grunert, Marcel ;
Weese, David ;
Reinert, Knut ;
Sperling, Silke R. .
BIOINFORMATICS, 2010, 26 (01) :123-124
[9]   Striped Smith-Waterman speeds database searches six times over other SIMD implementations [J].
Farrar, Michael .
BIOINFORMATICS, 2007, 23 (02) :156-161
[10]   Discovering microRNAs from deep sequencing data using miRDeep [J].
Friedlaender, Marc R. ;
Chen, Wei ;
Adamidi, Catherine ;
Maaskola, Jonas ;
Einspanier, Ralf ;
Knespel, Signe ;
Rajewsky, Nikolaus .
NATURE BIOTECHNOLOGY, 2008, 26 (04) :407-415