Quantitative and Qualitative Comparison of 2D and 3D Projection Techniques for High-Dimensional Data

被引:7
作者
Tian, Zonglin [1 ]
Zhai, Xiaorui [2 ]
van Steenpaal, Gijs [1 ]
Yu, Lingyun [3 ]
Dimara, Evanthia [1 ]
Espadoto, Mateus [2 ,4 ]
Telea, Alexandru [1 ]
机构
[1] Univ Utrecht, Dept Informat & Comp Sci, NL-3584 CS Utrecht, Netherlands
[2] Univ Groningen, Bernoulli Inst, NL-9712 CP Groningen, Netherlands
[3] Xian Jiaotong Liverpool Univ, Dept Comp, Suzhou 215123, Peoples R China
[4] Univ Sao Paulo, Inst Math & Stat, BR-01000000 Sao Paulo, Brazil
关键词
dimensionality reduction; projection quality evaluation; projection explaining; DATA VISUALIZATION; VISUAL ANALYSIS; REDUCTION; INFORMATION; SPACE; SCATTERPLOT; FRAMEWORK; FIT;
D O I
10.3390/info12060239
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Projections are well-known techniques that help the visual exploration of high-dimensional data by creating depictions thereof in a low-dimensional space. While projections that target the 2D space have been studied in detail both quantitatively and qualitatively, 3D projections are far less well understood, with authors arguing both for and against the added-value of a third visual dimension. We fill this gap by first presenting a quantitative study that compares 2D and 3D projections along a rich selection of datasets, projection techniques, and quality metrics. To refine these insights, we conduct a qualitative study that compares the preference of users in exploring high-dimensional data using 2D vs. 3D projections, both without and with visual explanations. Our quantitative and qualitative findings indicate that, in general, 3D projections bring only limited added-value atop of the one provided by their 2D counterparts. However, certain 3D projection techniques can show more structure than their 2D counterparts, and can stimulate users to further exploration. All our datasets, source code, and measurements are made public for ease of replication and extension.
引用
收藏
页数:21
相关论文
共 99 条
  • [1] Stochastic proximity embedding
    Agrafiotis, DK
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2003, 24 (10) : 1215 - 1221
  • [2] Albuquerque G., 2011, P IEEE C VIS AN SCI, P11
  • [3] [Anonymous], 2009, J MACH LEARN RES, DOI DOI 10.1080/13506280444000102
  • [4] [Anonymous], 2021, **DATA OBJECT**
  • [5] [Anonymous], 2002, Information Visualization in Data Mining and Knowledge Discovery
  • [6] [Anonymous], 2021, REUTERS DATASET
  • [7] [Anonymous], 2021, WISCONSIN BREAST CAN
  • [8] [Anonymous], 2012, VISUALIZATION LARGE, DOI [DOI 10.4230/OASICS.VLUDS.2011.135, 10.4230/OASIcs.VLUDS.2011.1352, DOI 10.4230/OASICS.VLUDS.2011.1352]
  • [9] [Anonymous], 2002, LIB LOND, DOI DOI 10.2172/15002155
  • [10] [Anonymous], 2021, EXPLANATORY VISUALIZ