InCHlib - interactive cluster heatmap for web applications

被引:45
作者
Skuta, Ctibor [1 ,2 ]
Bartunek, Petr [2 ]
Svozil, Daniel [1 ,2 ]
机构
[1] Prague Inst Chem Technol, Fac Chem Technol, Lab Informat & Chem, CZ-16628 Prague, Czech Republic
[2] Acad Sci Czech Republ, Inst Mol Genet, CZ OPENSCREEN, Vvi, CZ-14220 Prague, Czech Republic
关键词
Data clustering; Cluster heatmap; Scientific visualization; Web integration; Client-side scripting; !text type='Java']Java[!/text]Script library; Big data; Exploration; CANCER GENOMICS BROWSER; MICROARRAY DATA; MOLECULAR DIVERSITY; SCAFFOLD DIVERSITY; NATURAL-PRODUCTS; LEAD DISCOVERY; VISUALIZATION; DRUGS; PLATFORM; TARGETS;
D O I
10.1186/s13321-014-0044-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Hierarchical clustering is an exploratory data analysis method that reveals the groups (clusters) of similar objects. The result of the hierarchical clustering is a tree structure called dendrogram that shows the arrangement of individual clusters. To investigate the row/column hierarchical cluster structure of a data matrix, a visualization tool called 'cluster heatmap' is commonly employed. In the cluster heatmap, the data matrix is displayed as a heatmap, a 2-dimensional array in which the colour of each element corresponds to its value. The rows/columns of the matrix are ordered such that similar rows/columns are near each other. The ordering is given by the dendrogram which is displayed on the side of the heatmap. Results: We developed InCHlib (Interactive Cluster Heatmap Library), a highly interactive and lightweight JavaScript library for cluster heatmap visualization and exploration. InCHlib enables the user to select individual or clustered heatmap rows, to zoom in and out of clusters or to flexibly modify heatmap appearance. The cluster heatmap can be augmented with additional metadata displayed in a different colour scale. In addition, to further enhance the visualization, the cluster heatmap can be interconnected with external data sources or analysis tools. Data clustering and the preparation of the input file for InCHlib is facilitated by the Python utility script inchlib_clust. Conclusions: The cluster heatmap is one of the most popular visualizations of large chemical and biomedical data sets originating, e. g., in high-throughput screening, genomics or transcriptomics experiments. The presented JavaScript library InCHlib is a client-side solution for cluster heatmap exploration. InCHlib can be easily deployed into any modern web application and configured to cooperate with external tools and data sources. Though InCHlib is primarily intended for the analysis of chemical or biological data, it is a versatile tool which application domain is not limited to the life sciences only.
引用
收藏
页数:9
相关论文
共 57 条
[21]   Scaffold Distributions in Bioactive Molecules, Clinical Trials Compounds, and Drugs [J].
Hu, Ye ;
Bajorath, Juergen .
CHEMMEDCHEM, 2010, 5 (02) :187-190
[22]   Expression Profiler:: next generation-an online platform for analysis of microarray data [J].
Kapushesky, M ;
Kemmeren, P ;
Culhane, AC ;
Durinck, S ;
Ihmels, J ;
Körner, C ;
Kull, M ;
Torrente, A ;
Sarkans, U ;
Vilo, J ;
Brazma, A .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W465-W470
[23]   Nuclear hormone receptors: Ligand-activated regulators of transcription and diverse cell responses [J].
Katzenellenbogen, JA ;
Katzenellenbogen, BS .
CHEMISTRY & BIOLOGY, 1996, 3 (07) :529-536
[24]  
Kim Nayoung, 2012, Genomics & Informatics, V10, P263, DOI 10.5808/GI.2012.10.4.263
[25]   Assessing the scaffold diversity of screening libraries [J].
Krier, M ;
Bret, G ;
Rognan, D .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) :512-524
[26]   Scaffold architecture and pharmacophoric properties of natural products and trade drugs: Application in the design of natural product-based combinatorial libraries [J].
Lee, ML ;
Schneider, G .
JOURNAL OF COMBINATORIAL CHEMISTRY, 2001, 3 (03) :284-289
[27]   StratomeX: Visual Analysis of Large-Scale Heterogeneous Genomics Data for Cancer Subtype Characterization [J].
Lex, A. ;
Streit, M. ;
Schulz, H. -J. ;
Partl, C. ;
Schmalstieg, D. ;
Park, P. J. ;
Gehlenborg, N. .
COMPUTER GRAPHICS FORUM, 2012, 31 (03) :1175-1184
[28]  
MacCuish JD, 2013, WILEY INTERDISCIP RE, V4, P34
[29]   THE NUCLEAR RECEPTOR SUPERFAMILY - THE 2ND DECADE [J].
MANGELSDORF, DJ ;
THUMMEL, C ;
BEATO, M ;
HERRLICH, P ;
SCHUTZ, G ;
UMESONO, K ;
BLUMBERG, B ;
KASTNER, P ;
MARK, M ;
CHAMBON, P ;
EVANS, RM .
CELL, 1995, 83 (06) :835-839
[30]   Babelomics: an integrative platform for the analysis of transcriptomics, proteomics and genomic data with advanced functional profiling [J].
Medina, Ignacio ;
Carbonell, Jose ;
Pulido, Luis ;
Madeira, Sara C. ;
Goetz, Stefan ;
Conesa, Ana ;
Tarraga, Joaquin ;
Pascual-Montano, Alberto ;
Nogales-Cadenas, Ruben ;
Santoyo, Javier ;
Garcia, Francisco ;
Marba, Martina ;
Montaner, David ;
Dopazo, Joaquin .
NUCLEIC ACIDS RESEARCH, 2010, 38 :W210-W213