Revisiting Dimensionality Reduction Techniques for Visual Cluster Analysis: An Empirical Study

被引:36
|
作者
Xia, Jiazhi [1 ]
Zhang, Yuchen [1 ]
Song, Jie [1 ]
Chen, Yang [2 ]
Wang, Yunhai [3 ]
Liu, Shixia [4 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] 14 Data, Shanghai, Peoples R China
[3] Shandong Univ, Sch Comp Sci & Technol, Jinan, Peoples R China
[4] Tsinghua Univ, Sch Software, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Task analysis; Principal component analysis; Measurement; Manifolds; Linearity; Visual perception; Dimensionality reduction; visual cluster analysis; perception-based evaluation; T-SNE; PROJECTION; QUALITY;
D O I
10.1109/TVCG.2021.3114694
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Dimensionality Reduction (DR) techniques can generate 2D projections and enable visual exploration of cluster structures of high-dimensional datasets. However, different DR techniques would yield various patterns, which significantly affect the performance of visual cluster analysis tasks. We present the results of a user study that investigates the influence of different DR techniques on visual cluster analysis. Our study focuses on the most concerned property types, namely the linearity and locality, and evaluates twelve representative DR techniques that cover the concerned properties. Four controlled experiments were conducted to evaluate how the DR techniques facilitate the tasks of 1) cluster identification, 2) membership identification, 3) distance comparison, and 4) density comparison, respectively. We also evaluated users' subjective preference of the DR techniques regarding the quality of projected clusters. The results show that: 1) Non-linear and Local techniques are preferred in cluster identification and membership identification; 2) Linear techniques perform better than non-linear techniques in density comparison; 3) UMAP (Uniform Manifold Approximation and Projection) and t-SNE (t-Distributed Stochastic Neighbor Embedding) perform the best in cluster identification and membership identification; 4) NMF (Nonnegative Matrix Factorization) has competitive performance in distance comparison; 5) t-SNLE (t-Distributed Stochastic Neighbor Linear Embedding) has competitive performance in density comparison.
引用
收藏
页码:529 / 539
页数:11
相关论文
共 50 条
  • [41] Dimensionality Reduction of Synchrophasor Data for Early Event Detection: Linearized Analysis
    Xie, Le
    Chen, Yang
    Kumar, P. R.
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2014, 29 (06) : 2784 - 2794
  • [42] Exploring Dimensionality Reduction Techniques in Multilingual Transformers
    Álvaro Huertas-García
    Alejandro Martín
    Javier Huertas-Tato
    David Camacho
    Cognitive Computation, 2023, 15 : 590 - 612
  • [43] A Comparison of Dimensionality Reduction Techniques for Hyperspectral Imagery
    Race, Benjamin
    Wittman, Todd
    ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGING XXVIII, 2022, 12094
  • [44] Dimensionality reduction techniques in structural and earthquake engineering
    Hajibabaee, P.
    Pourkamali-Anaraki, F.
    Hariri-Ardebili, M. A.
    ENGINEERING STRUCTURES, 2023, 278
  • [45] Exploring Dimensionality Reduction Techniques in Multilingual Transformers
    Huertas-Garcia, Alvaro
    Martin, Alejandro
    Huertas-Tato, Javier
    Camacho, David
    COGNITIVE COMPUTATION, 2023, 15 (02) : 590 - 612
  • [46] A Review of Dimensionality Reduction Techniques for Efficient Computation
    Velliangiri, S.
    Alagumuthukrishnan, S.
    Joseph, S. Iwin Thankumar
    2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 104 - 111
  • [47] Multilinear Spatial Discriminant Analysis for Dimensionality Reduction
    Yuan, Sen
    Mao, Xia
    Chen, Lijiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (06) : 2669 - 2681
  • [48] Divergent Projection Analysis for Unsupervised Dimensionality Reduction
    Wang, Shanshan
    Bai, Lan
    Chen, Xu
    Wang, Zhen
    Shao, Yuan-Hai
    8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2020 & 2021): DEVELOPING GLOBAL DIGITAL ECONOMY AFTER COVID-19, 2022, 199 : 384 - 391
  • [49] Dimensionality reduction techniques for iot based data
    Tomar D.
    Tomar P.
    Recent Advances in Computer Science and Communications, 2021, 14 (03) : 724 - 735
  • [50] Exploring Dimensionality Reduction Techniques for Efficient Surrogate-Assisted Optimization
    Ullah, Sibghat
    Duc Anh Nguyen
    Wang, Hao
    Menzel, Stefan
    Sendhoff, Bernhard
    Baeck, Thomas
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 2965 - 2974