ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap

被引:2667
作者
Metsalu, Tauno [1 ]
Vilo, Jaak [1 ]
机构
[1] Univ Tartu, Inst Comp Sci, EE-50409 Tartu, Estonia
关键词
PACKAGE;
D O I
10.1093/nar/gkv468
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Principal Component Analysis (PCA) is a widely used method of reducing the dimensionality of high-dimensional data, often followed by visualizing two of the components on the scatterplot. Although widely used, the method is lacking an easy-to-use web interface that scientists with little programming skills could use to make plots of their own data. The same applies to creating heatmaps: it is possible to add conditional formatting for Excel cells to show colored heatmaps, but for more advanced features such as clustering and experimental annotations, more sophisticated analysis tools have to be used. We present a web tool called ClustVis that aims to have an intuitive user interface. Users can upload data from a simple delimited text file that can be created in a spreadsheet program. It is possible to modify data processing methods and the final appearance of the PCA and heatmap plots by using drop-down menus, text boxes, sliders etc. Appropriate defaults are given to reduce the time needed by the user to specify input parameters. As an output, users can download PCA plot and heatmap in one of the preferred file formats. This web server is freely available at http://biit.cs.ut.ee/clustvis/.
引用
收藏
页码:W566 / W570
页数:5
相关论文
共 23 条
[1]   Mining for coexpression across hundreds of datasets using novel rank aggregation and visualization methods [J].
Adler, Priit ;
Kolde, Raivo ;
Kull, Meelis ;
Tkachenko, Aleksandr ;
Peterson, Hedi ;
Reimand, Jueri ;
Vilo, Jaak .
GENOME BIOLOGY, 2009, 10 (12)
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   ArrayExpress - a public repository for microarray gene expression data at the EBI [J].
Brazma, A ;
Parkinson, H ;
Sarkans, U ;
Shojatalab, M ;
Vilo, J ;
Abeygunawardena, N ;
Holloway, E ;
Kapushesky, M ;
Kemmeren, P ;
Lara, GG ;
Oezcimen, A ;
Rocca-Serra, P ;
Sansone, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :68-71
[4]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[5]   imDEV: a graphical user interface to R multivariate analysis tools in Microsoft Excel [J].
Grapov, Dmitry ;
Newman, John W. .
BIOINFORMATICS, 2012, 28 (17) :2288-2290
[6]   ColorBrewer.org: An online tool for selecting colour schemes for maps [J].
Harrower, M ;
Brewer, CA .
CARTOGRAPHIC JOURNAL, 2003, 40 (01) :27-37
[7]  
Ihaka R., 1996, J. Comput. Graph. Stat., V5, P299, DOI [10.2307/1390807, 10.1080/10618600.1996.10474713, DOI 10.1080/10618600.1996.10474713]
[8]  
Jolliffe I., 2002, PRINCIPAL COMPONENT, DOI [10.1007/978-1-4757-1904-8_7, 10.1016/0169-7439(87)80084-9]
[9]   Reactome: a knowledgebase of biological pathways [J].
Joshi-Tope, G ;
Gillespie, M ;
Vastrik, I ;
D'Eustachio, P ;
Schmidt, E ;
de Bono, B ;
Jassal, B ;
Gopinath, GR ;
Wu, GR ;
Matthews, L ;
Lewis, S ;
Birney, E ;
Stein, L .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D428-D432
[10]   Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges [J].
Khatri, Purvesh ;
Sirota, Marina ;
Butte, Atul J. .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (02)