covRNA: discovering covariate associations in large-scale gene expression data

被引:0
作者
Urban, Lara [1 ,2 ]
Remmele, Christian W. [1 ]
Dittrich, Marcus [1 ,3 ]
Schwarz, Roland F. [4 ]
Mueller, Tobias [1 ]
机构
[1] Univ Wurzburg, Dept Bioinformat, Bioctr, Wurzburg, Germany
[2] European Mol Biol Lab, European Bioinformat Inst, Wellcome Genome Campus, Cambridge, England
[3] Univ Wurzburg, Inst Human Genet, Wurzburg, Germany
[4] Max Delbruck Ctr, Berlin Inst Med Syst Biol, Berlin, Germany
关键词
Multivariate analysis; Fourthcorner analysis; RLQ analysis; Transcriptomics; High-throughput data; Visualization; Ordination methods; RNA-Seq analysis; Microarray analysis; 4TH-CORNER;
D O I
10.1186/s13104-020-04946-1
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
ObjectiveThe biological interpretation of gene expression measurements is a challenging task. While ordination methods are routinely used to identify clusters of samples or co-expressed genes, these methods do not take sample or gene annotations into account. We aim to provide a tool that allows users of all backgrounds to assess and visualize the intrinsic correlation structure of complex annotated gene expression data and discover the covariates that jointly affect expression patterns.ResultsThe Bioconductor package covRNA provides a convenient and fast interface for testing and visualizing complex relationships between sample and gene covariates mediated by gene expression data in an entirely unsupervised setting. The relationships between sample and gene covariates are tested by statistical permutation tests and visualized by ordination. The methods are inspired by the fourthcorner and RLQ analyses used in ecological research for the analysis of species abundance data, that we modified to make them suitable for the distributional characteristics of both, RNA-Seq read counts and microarray intensities, and to provide a high-performance parallelized implementation for the analysis of large-scale gene expression data on multi-core computational systems. CovRNA provides additional modules for unsupervised gene filtering and plotting functions to ensure a smooth and coherent analysis workflow.
引用
收藏
页数:5
相关论文
共 12 条
[1]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[2]   Exploring the transcription factor activity in high-throughput gene expression data using RLQ analysis [J].
Baty, Florent ;
Ruediger, Jochen ;
Miglino, Nicola ;
Kern, Lukas ;
Borger, Peter ;
Brutsche, Martin .
BMC BIOINFORMATICS, 2013, 14
[3]   GOstat: find statistically overrepresented Gene Ontologies within a group of genes [J].
Beissbarth, T ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (09) :1464-1465
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   Kruppel-like Factor 4 modulates interleukin-6 release in human dendritic cells after in vitro stimulation with Aspergillus fumigatus and Candida albicans [J].
Czakai, Kristin ;
Leonhardt, Ines ;
Dix, Andreas ;
Bonin, Michael ;
Linde, Joerg ;
Einsele, Hermann ;
Kurzai, Oliver ;
Loeffler, Juergen .
SCIENTIFIC REPORTS, 2016, 6
[6]   Combining the fourth-corner and the RLQ methods for assessing trait responses to environmental variation [J].
Dray, Stephane ;
Choler, Philippe ;
Doledec, Sylvain ;
Peres-Neto, Pedro R. ;
Thuiller, Wilfried ;
Pavoine, Sandrine ;
ter Braak, Cajo J. F. .
ECOLOGY, 2014, 95 (01) :14-21
[7]   The ade4 package: Implementing the duality diagram for ecologists [J].
Dray, Stephane ;
Dufour, Anne-Beatrice .
JOURNAL OF STATISTICAL SOFTWARE, 2007, 22 (04) :1-20
[8]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[9]   Strand-Specific RNA-Seq Reveals Ordered Patterns of Sense and Antisense Transcription in Bacillus anthracis [J].
Passalacqua, Karla D. ;
Varadarajan, Anjana ;
Weist, Charlotte ;
Ondov, Brian D. ;
Byrd, Benjamin ;
Read, Timothy D. ;
Bergman, Nicholas H. .
PLOS ONE, 2012, 7 (08)
[10]   Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles [J].
Subramanian, A ;
Tamayo, P ;
Mootha, VK ;
Mukherjee, S ;
Ebert, BL ;
Gillette, MA ;
Paulovich, A ;
Pomeroy, SL ;
Golub, TR ;
Lander, ES ;
Mesirov, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (43) :15545-15550