Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates

被引：29

作者：

Al Seesi, Sahar ^{[1
]}

Tiagueu, Yvette Temate ^{[2
]}

Zelikovsky, Alexander ^{[2
]}

Mandoiu, Ion I. ^{[1
]}

机构：

[1] Univ Connecticut, Dept Comp Engn & Sci, Storrs, CT 06269 USA

[2] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA

来源：

BMC GENOMICS | 2014年 / 15卷

基金：

美国食品与农业研究所; 美国国家科学基金会;

关键词：

SPLICING ISOFORM FREQUENCIES;

D O I：

10.1186/1471-2164-15-S8-S2

中图分类号：

Q81 [生物工程学（生物技术）]; Q93 [微生物学];

学科分类号：

071005 ; 0836 ; 090102 ; 100705 ;

摘要：

A major application of RNA-Seq is to perform differential gene expression analysis. Many tools exist to analyze differentially expressed genes in the presence of biological replicates. Frequently, however, RNA-Seq experiments have no or very few biological replicates and development of methods for detecting differentially expressed genes in these scenarios is still an active research area. In this paper we introduce a novel method, called IsoDE, for differential gene expression analysis based on bootstrapping. We compared IsoDE against four existing methods (Fisher's exact test, GFOLD, edgeR and Cuffdiff) on RNA-Seq datasets generated using three different sequencing technologies, both with and without replicates. Experiments on MAQC RNA-Seq datasets without replicates show that IsoDE has consistently high accuracy as defined by the qPCR ground truth, frequently higher than that of the compared methods, particularly for low coverage data and at lower fold change thresholds. In experiments on RNA-Seq datasets with up to 7 replicates, IsoDE has also achieved high accuracy. Furthermore, unlike GFOLD and edgeR, IsoDE accuracy varies smoothly with the number of replicates, and is relatively uniform across the entire range of gene expression levels. The proposed non-parametric method based on bootstrapping has practical running time, and achieves robust performance over a broad range of technologies, number of replicates, sequencing depths, and minimum fold change thresholds.

引用

页数：10

共 50 条

[1] Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
Sahar Al Seesi
Yvette Temate Tiagueu
Alexander Zelikovsky
Ion I Măndoiu
BMC Genomics, 15
[2] CORNAS: coverage-dependent RNA-Seq analysis of gene expression data without biological replicates
Low, Joel Z. B.
Khang, Tsung Fei
Tammi, Martti T.
BMC BIOINFORMATICS, 2017, 18
[3] CORNAS: coverage-dependent RNA-Seq analysis of gene expression data without biological replicates
Joel Z. B. Low
Tsung Fei Khang
Martti T. Tammi
BMC Bioinformatics, 18
[4] On Differential Gene Expression Using RNA-Seq Data
Lee, Juhee
Ji, Yuan
Liang, Shoudan
Cai, Guoshuai
Mueller, Peter
CANCER INFORMATICS, 2011, 10 : 205 - 215
[5] Differential gene expression analysis using coexpression and RNA-Seq data
Yang, Ei-Wen
Girke, Thomas
Jiang, Tao
BIOINFORMATICS, 2013, 29 (17) : 2153 - 2161
[6] Robustness of differential gene expression analysis of RNA-seq
Stupnikov, A.
McInerney, C. E.
Savage, K. I.
McIntosh, S. A.
Emmert-Streib, F.
Kennedy, R.
Salto-Tellez, M.
Prise, K. M.
McArt, D. G.
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 3470 - 3481
[7] Differential expression analysis for paired RNA-seq data
Chung, Lisa M.
Ferguson, John P.
Zheng, Wei
Qian, Feng
Bruno, Vincent
Montgomery, Ruth R.
Zhao, Hongyu
BMC BIOINFORMATICS, 2013, 14 : 110
[8] Differential expression analysis for paired RNA-seq data
Lisa M Chung
John P Ferguson
Wei Zheng
Feng Qian
Vincent Bruno
Ruth R Montgomery
Hongyu Zhao
BMC Bioinformatics, 14
[9] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
Franck Rapaport
Raya Khanin
Yupu Liang
Mono Pirun
Azra Krek
Paul Zumbo
Christopher E Mason
Nicholas D Socci
Doron Betel
Genome Biology, 14
[10] Measuring differential gene expression with RNA-seq: challenges and strategies for data analysis
Finotello, Francesca
Di Camillo, Barbara
BRIEFINGS IN FUNCTIONAL GENOMICS, 2015, 14 (02) : 130 - 142

← 1 2 3 4 5 →