Epiviz: a view inside the design of an integrated visual analysis software for genomics

被引:3
作者
Chelaru, Florin [1 ,2 ]
Bravo, Hector Corrada [1 ,2 ]
机构
[1] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
来源
BMC BIOINFORMATICS | 2015年 / 16卷
基金
美国国家卫生研究院;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; RNA-SEQ EXPERIMENTS; BROWSER DATABASE; BIOCONDUCTOR; ANALYTICS; GENE;
D O I
10.1186/1471-2105-16-S11-S4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Computational and visual data analysis for genomics has traditionally involved a combination of tools and resources, of which the most ubiquitous consist of genome browsers, focused mainly on integrative visualization of large numbers of big datasets, and computational environments, focused on data modeling of a small number of moderately sized datasets. Workflows that involve the integration and exploration of multiple heterogeneous data sources, small and large, public and user specific have been poorly addressed by these tools. In our previous work, we introduced Epiviz, which bridges the gap between the two types of tools, simplifying these workflows. Results: In this paper we expand on the design decisions behind Epiviz, and introduce a series of new advanced features that further support the type of interactive exploratory workflow we have targeted. We discuss three ways in which Epiviz advances the field of genomic data analysis: 1) it brings code to interactive visualizations at various different levels; 2) takes the first steps in the direction of collaborative data analysis by incorporating user plugins from source control providers, as well as by allowing analysis states to be shared among the scientific community; 3) combines established analysis features that have never before been available simultaneously in a genome browser. In our discussion section, we present security implications of the current design, as well as a series of limitations and future research steps. Conclusions: Since many of the design choices of Epiviz are novel in genomics data analysis, this paper serves both as a document of our own approaches with lessons learned, as well as a start point for future efforts in the same direction for the genomics community.
引用
收藏
页数:14
相关论文
共 36 条
  • [1] Ahlberg C., 1996, SIGMOD Record, V25, P25, DOI 10.1145/245882.245893
  • [2] Differential expression analysis for sequence count data
    Anders, Simon
    Huber, Wolfgang
    [J]. GENOME BIOLOGY, 2010, 11 (10):
  • [3] Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays
    Aryee, Martin J.
    Jaffe, Andrew E.
    Corrada-Bravo, Hector
    Ladd-Acosta, Christine
    Feinberg, Andrew P.
    Hansen, Kasper D.
    Irizarry, Rafael A.
    [J]. BIOINFORMATICS, 2014, 30 (10) : 1363 - 1369
  • [4] D3: Data-Driven Documents
    Bostock, Michael
    Ogievetsky, Vadim
    Heer, Jeffrey
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) : 2301 - 2309
  • [5] Chelaru F, 2014, NAT METHODS, V11, P938, DOI [10.1038/NMETH.3038, 10.1038/nmeth.3038]
  • [6] Ensembl 2002: accommodating comparative genomics
    Clamp, M
    Andrews, D
    Barker, D
    Bevan, P
    Cameron, G
    Chen, Y
    Clark, L
    Cox, T
    Cuff, J
    Curwen, V
    Down, T
    Durbin, R
    Eyras, E
    Gilbert, J
    Hammond, M
    Hubbard, T
    Kasprzyk, A
    Keefe, D
    Lehvaslaiho, H
    Iyer, V
    Melsopp, C
    Mongin, E
    Pettett, R
    Potter, S
    Rust, A
    Schmidt, E
    Searle, S
    Slater, G
    Smith, J
    Spooner, W
    Stabenau, A
    Stalker, J
    Stupka, E
    Ureta-Vidal, A
    Vastrik, I
    Birney, E
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 38 - 42
  • [7] Dudoit S, 2002, STAT SINICA, V12, P111
  • [8] BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis
    Durinck, S
    Moreau, Y
    Kasprzyk, A
    Davis, S
    De Moor, B
    Brazma, A
    Huber, W
    [J]. BIOINFORMATICS, 2005, 21 (16) : 3439 - 3440
  • [9] The ENCODE (ENCyclopedia of DNA elements) Project
    Feingold, EA
    Good, PJ
    Guyer, MS
    Kamholz, S
    Liefer, L
    Wetterstrand, K
    Collins, FS
    Gingeras, TR
    Kampa, D
    Sekinger, EA
    Cheng, J
    Hirsch, H
    Ghosh, S
    Zhu, Z
    Pate, S
    Piccolboni, A
    Yang, A
    Tammana, H
    Bekiranov, S
    Kapranov, P
    Harrison, R
    Church, G
    Struhl, K
    Ren, B
    Kim, TH
    Barrera, LO
    Qu, C
    Van Calcar, S
    Luna, R
    Glass, CK
    Rosenfeld, MG
    Guigo, R
    Antonarakis, SE
    Birney, E
    Brent, M
    Pachter, L
    Reymond, A
    Dermitzakis, ET
    Dewey, C
    Keefe, D
    Denoeud, F
    Lagarde, J
    Ashurst, J
    Hubbard, T
    Wesselink, JJ
    Castelo, R
    Eyras, E
    Myers, RM
    Sidow, A
    Batzoglou, S
    [J]. SCIENCE, 2004, 306 (5696) : 636 - 640
  • [10] Savant: genome browser for high-throughput sequencing data
    Fiume, Marc
    Williams, Vanessa
    Brook, Andrew
    Brudno, Michael
    [J]. BIOINFORMATICS, 2010, 26 (16) : 1938 - 1944