t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

被引:113
作者
Chatzimparmpas, Angelos [1 ]
Martins, Rafael M. [1 ]
Kerren, Andreas [1 ]
机构
[1] Linnaeus Univ, Dept Comp Sci & Media Technol, S-35195 Vaxjo, Sweden
关键词
Tools; Visualization; Data visualization; Task analysis; Correlation; Principal component analysis; Dimensionality reduction; Interpretable t-SNE; dimensionality reduction; high-dimensional data; explainable machine learning; visualization; HIGH-DIMENSIONAL DATA; VISUAL ANALYSIS; REDUCTION; QUALITY; AXES;
D O I
10.1109/TVCG.2020.2986996
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of multidimensional data has proven to be a popular approach, with successful applications in a wide range of domains. Despite their usefulness, t-SNE projections can be hard to interpret or even misleading, which hurts the trustworthiness of the results. Understanding the details of t-SNE itself and the reasons behind specific patterns in its output may be a daunting task, especially for non-experts in dimensionality reduction. In this article, we present t-viSNE, an interactive tool for the visual exploration of t-SNE projections that enables analysts to inspect different aspects of their accuracy and meaning, such as the effects of hyper-parameters, distance and neighborhood preservation, densities and costs of specific neighborhoods, and the correlations between dimensions and visual patterns. We propose a coherent, accessible, and well-integrated collection of different views for the visualization of t-SNE projections. The applicability and usability of t-viSNE are demonstrated through hypothetical usage scenarios with real data sets. Finally, we present the results of a user study where the tool's effectiveness was evaluated. By bringing to light information that would normally be lost after running t-SNE, we hope to support analysts in using t-SNE and making its results better understandable.
引用
收藏
页码:2696 / 2714
页数:19
相关论文
共 77 条
[1]   ClustMe: A Visual Quality Measure for Ranking Monochrome Scatterplots based on Cluster Patterns [J].
Abbas, Mostafa M. ;
Aupetit, Michael ;
Sedlmair, Michael ;
Bensmail, Halima .
COMPUTER GRAPHICS FORUM, 2019, 38 (03) :225-236
[2]   viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia [J].
Amir, El-ad David ;
Davis, Kara L. ;
Tadmor, Michelle D. ;
Simonds, Erin F. ;
Levine, Jacob H. ;
Bendall, Sean C. ;
Shenfeld, Daniel K. ;
Krishnaswamy, Smita ;
Nolan, Garry P. ;
Pe'er, Dana .
NATURE BIOTECHNOLOGY, 2013, 31 (06) :545-+
[3]   Similarity clustering of dimensions for an enhanced visualization of multidimensional data [J].
Ankerst, M ;
Berchtold, S ;
Keim, DA .
IEEE SYMPOSIUM ON INFORMATION VISUALIZATION - PROCEEDINGS, 1998, :52-+
[4]  
[Anonymous], 2020, PROJLIB A PYTHON LIB
[5]  
[Anonymous], 2012, P SIGCHI C HUMAN FAC, DOI [DOI 10.1145/2207676.2207741, 10.1145/2207676.2207741]
[6]  
[Anonymous], 2018, P ESANN
[7]   Visualizing distortions and recovering topology in continuous projection techniques [J].
Aupetit, Michael .
NEUROCOMPUTING, 2007, 70 (7-9) :1304-1330
[8]  
Borg I., 2005, MODERN MULTIDIMENSIO, DOI DOI 10.1007/0-387-28981-X
[9]  
Carpendale S, 2008, LECT NOTES COMPUT SC, V4950, P19, DOI 10.1007/978-3-540-70956-5_2
[10]   A Visual Interaction Framework for Dimensionality Reduction Based Data Exploration [J].
Cavallo, Marco ;
Demiralp, Cagatay .
PROCEEDINGS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2018), 2018,