GENAVi: a shiny web application for gene expression normalization, analysis and visualization

被引:37
作者
Reyes, Alberto Luiz P. [1 ]
Silva, Tiago C. [1 ]
Coetzee, Simon G. [1 ]
Plummer, Jasmine T. [1 ]
Davis, Brian D. [1 ]
Chen, Stephanie [1 ]
Hazelett, Dennis J. [1 ]
Lawrenson, Kate [2 ]
Berman, Benjamin P. [1 ]
Gayther, Simon A. [1 ]
Jones, Michelle R. [1 ]
机构
[1] Cedars Sinai Med Ctr, Dept Biomed Sci, Ctr Bioinformat & Funct Genom, Los Angeles, CA 90048 USA
[2] Cedars Sinai Med Ctr, Samuel Oschin Comprehens Canc Inst, Womens Canc Program, Los Angeles, CA 90048 USA
关键词
Next generation sequencing; RNA-seq; Shiny; GUI; Differential expression; Visualization; Normalization; DIFFERENTIAL EXPRESSION; ONTOLOGY;
D O I
10.1186/s12864-019-6073-7
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background The development of next generation sequencing (NGS) methods led to a rapid rise in the generation of large genomic datasets, but the development of user-friendly tools to analyze and visualize these datasets has not developed at the same pace. This presents a two-fold challenge to biologists; the expertise to select an appropriate data analysis pipeline, and the need for bioinformatics or programming skills to apply this pipeline. The development of graphical user interface (GUI) applications hosted on web-based servers such as Shiny can make complex workflows accessible across operating systems and internet browsers to those without programming knowledge. Results We have developed GENAVi (Gene Expression Normalization Analysis and Visualization) to provide a user-friendly interface for normalization and differential expression analysis (DEA) of human or mouse feature count level RNA-Seq data. GENAVi is a GUI based tool that combines Bioconductor packages in a format for scientists without bioinformatics expertise. We provide a panel of 20 cell lines commonly used for the study of breast and ovarian cancer within GENAVi as a foundation for users to bring their own data to the application. Users can visualize expression across samples, cluster samples based on gene expression or correlation, calculate and plot the results of principal components analysis, perform DEA and gene set enrichment and produce plots for each of these analyses. To allow scalability for large datasets we have provided local install via three methods. We improve on available tools by offering a range of normalization methods and a simple to use interface that provides clear and complete session reporting and for reproducible analysis. Conclusion The development of tools using a GUI makes them practical and accessible to scientists without bioinformatics expertise, or access to a data analyst with relevant skills. While several GUI based tools are currently available for RNA-Seq analysis we improve on these existing tools. This user-friendly application provides a convenient platform for the normalization, analysis and visualization of gene expression data for scientists without bioinformatics expertise.
引用
收藏
页数:9
相关论文
共 29 条
[1]  
[Anonymous], 2017, J OPEN SOURCE SOFTW, DOI [DOI 10.21105/JOSS.00359, 10.21105/joss.00359]
[2]  
[Anonymous], DEBROWSER INTERACTIV
[3]  
[Anonymous], IDEP INTEGRATED WEB
[4]  
[Anonymous], DEGUST
[5]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[6]   GO::TermFinder - open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes [J].
Boyle, EI ;
Weng, SA ;
Gollub, J ;
Jin, H ;
Botstein, D ;
Cherry, JM ;
Sherlock, G .
BIOINFORMATICS, 2004, 20 (18) :3710-3715
[7]   The Gene Ontology Resource: 20 years and still GOing strong [J].
Carbon, S. ;
Douglass, E. ;
Dunn, N. ;
Good, B. ;
Harris, N. L. ;
Lewis, S. E. ;
Mungall, C. J. ;
Basu, S. ;
Chisholm, R. L. ;
Dodson, R. J. ;
Hartline, E. ;
Fey, P. ;
Thomas, P. D. ;
Albou, L. P. ;
Ebert, D. ;
Kesling, M. J. ;
Mi, H. ;
Muruganujian, A. ;
Huang, X. ;
Poudel, S. ;
Mushayahama, T. ;
Hu, J. C. ;
LaBonte, S. A. ;
Siegele, D. A. ;
Antonazzo, G. ;
Attrill, H. ;
Brown, N. H. ;
Fexova, S. ;
Garapati, P. ;
Jones, T. E. M. ;
Marygold, S. J. ;
Millburn, G. H. ;
Rey, A. J. ;
Trovisco, V. ;
dos Santos, G. ;
Emmert, D. B. ;
Falls, K. ;
Zhou, P. ;
Goodman, J. L. ;
Strelets, V. B. ;
Thurmond, J. ;
Courtot, M. ;
Osumi-Sutherland, D. ;
Parkinson, H. ;
Roncaglia, P. ;
Acencio, M. L. ;
Kuiper, M. ;
Laegreid, A. ;
Logie, C. ;
Lovering, R. C. .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D330-D338
[8]   TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data [J].
Colaprico, Antonio ;
Silva, Tiago C. ;
Olsen, Catharina ;
Garofano, Luciano ;
Cava, Claudia ;
Garolini, Davide ;
Sabedot, Thais S. ;
Malta, Tathiane M. ;
Pagnotta, Stefano M. ;
Castiglioni, Isabella ;
Ceccarelli, Michele ;
Bontempi, Gianluca ;
Noushmehr, Houtan .
NUCLEIC ACIDS RESEARCH, 2016, 44 (08) :e71
[9]   A survey of best practices for RNA-seq data analysis [J].
Conesa, Ana ;
Madrigal, Pedro ;
Tarazona, Sonia ;
Gomez-Cabrero, David ;
Cervera, Alejandra ;
McPherson, Andrew ;
Szczesniak, Michal Wojciech ;
Gaffney, Daniel J. ;
Elo, Laura L. ;
Zhang, Xuegong ;
Mortazavi, Ali .
GENOME BIOLOGY, 2016, 17
[10]   iDEP: an integrated web application for differential expression and pathway analysis of RNA-Seq data [J].
Ge, Steven Xijin ;
Son, Eun Wo ;
Yao, Runan .
BMC BIOINFORMATICS, 2018, 19