t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

被引:118
作者
Chatzimparmpas, Angelos [1 ]
Martins, Rafael M. [1 ]
Kerren, Andreas [1 ]
机构
[1] Linnaeus Univ, Dept Comp Sci & Media Technol, S-35195 Vaxjo, Sweden
关键词
Tools; Visualization; Data visualization; Task analysis; Correlation; Principal component analysis; Dimensionality reduction; Interpretable t-SNE; dimensionality reduction; high-dimensional data; explainable machine learning; visualization; HIGH-DIMENSIONAL DATA; VISUAL ANALYSIS; REDUCTION; QUALITY; AXES;
D O I
10.1109/TVCG.2020.2986996
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of multidimensional data has proven to be a popular approach, with successful applications in a wide range of domains. Despite their usefulness, t-SNE projections can be hard to interpret or even misleading, which hurts the trustworthiness of the results. Understanding the details of t-SNE itself and the reasons behind specific patterns in its output may be a daunting task, especially for non-experts in dimensionality reduction. In this article, we present t-viSNE, an interactive tool for the visual exploration of t-SNE projections that enables analysts to inspect different aspects of their accuracy and meaning, such as the effects of hyper-parameters, distance and neighborhood preservation, densities and costs of specific neighborhoods, and the correlations between dimensions and visual patterns. We propose a coherent, accessible, and well-integrated collection of different views for the visualization of t-SNE projections. The applicability and usability of t-viSNE are demonstrated through hypothetical usage scenarios with real data sets. Finally, we present the results of a user study where the tool's effectiveness was evaluated. By bringing to light information that would normally be lost after running t-SNE, we hope to support analysts in using t-SNE and making its results better understandable.
引用
收藏
页码:2696 / 2714
页数:19
相关论文
共 77 条
[31]  
Johnson M., 2017, T ASSOC COMPUT LING, V5, P339, DOI [DOI 10.1162/TACL_A_00065, 10.1162/tacla00065]
[32]   Local Affine Multidimensional Projection [J].
Joia, Paulo ;
Paulovich, Fernando V. ;
Coimbra, Danilo ;
Cuminato, Jose Alberto ;
Nonato, Luis Gustavo .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) :2563-2571
[33]   Principal component analysis: a review and recent developments [J].
Jolliffe, Ian T. ;
Cadima, Jorge .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2016, 374 (2065)
[34]  
Kandogan E, 2012, IEEE CONF VIS ANAL, P73, DOI 10.1109/VAST.2012.6400487
[35]  
Kaufman L., 1987, Statistical Data Analysis Based on the L1-Norm and Related Methods. First International Conference, P405
[36]   InterAxis: Steering Scatterplot Axes via Opservation-Level Interaction [J].
Kim, Hannah ;
Choo, Jaegul ;
Park, Haesun ;
Endert, Alex .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (01) :131-140
[37]   Clustervision: Visual Supervision of Unsupervised Clustering [J].
Kwon, Bum Chul ;
Eysenbach, Ben ;
Verma, Janu ;
Ng, Kenney ;
de Filippi, Christopher ;
Stewart, Walter F. ;
Perer, Adam .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) :142-151
[38]   AxiSketcher: Interactive Nonlinear Axis Mapping of Visualizations through User Drawings [J].
Kwon, Bum Chul ;
Kim, Hannah ;
Wall, Emily ;
Choo, Jaegul ;
Park, Haesun ;
Endert, Alex .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) :221-230
[39]   Exploring high-dimensional data through locally enhanced projections [J].
Lai, Chufan ;
Zhao, Ying ;
Yuan, Xiaoru .
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2018, 48 :144-156
[40]  
Lee JA, 2007, INFORM SCI STAT, P1