LDSScanner: Exploratory Analysis of Low-Dimensional Structures in High-Dimensional Datasets

被引:59
|
作者
Xia, Jiazhi [1 ]
Ye, Fenjin [1 ]
Chen, Wei [2 ]
Wang, Yusi [1 ]
Chen, Weifeng [3 ]
Ma, Yuxin [2 ]
Tung, Anthony K. H. [4 ]
机构
[1] Cent South Univ, Changsha, Hunan, Peoples R China
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Zhejiang Univ Finance & Econ, Hangzhou, Zhejiang, Peoples R China
[4] Natl Univ Singapore, Singapore, Singapore
基金
国家自然科学基金重大项目; 美国国家科学基金会;
关键词
High-dimensional data; low-dimensional structure; subspace; manifold; visual exploration; VISUAL EXPLORATION; VISUALIZATION; REDUCTION; METRICS;
D O I
10.1109/TVCG.2017.2744098
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many approaches for analyzing a high-dimensional dataset assume that the dataset contains specific structures, e.g., clusters in linear subspaces or non-linear manifolds. This yields a trial-and-error process to verify the appropriate model and parameters. This paper contributes an exploratory interface that supports visual identification of low-dimensional structures in a high-dimensional dataset, and facilitates the optimized selection of data models and configurations. Our key idea is to abstract a set of global and local feature descriptors from the neighborhood graph-based representation of the latent low-dimensional structure, such as pairwise geodesic distance (GD) among points and pairwise local tangent space divergence (LTSD) among pointwise local tangent spaces (LTS). We propose a new LTSD-GD view, which is constructed by mapping LTSD and GD to the x axis and y axis using 1D multidimensional scaling, respectively. Unlike traditional dimensionality reduction methods that preserve various kinds of distances among points, the LTSD-GD view presents the distribution of pointwise LTS (x axis) and the variation of LTS in structures (the combination of x axis and y axis). We design and implement a suite of visual tools for navigating and reasoning about intrinsic structures of a high-dimensional dataset. Three case studies verify the effectiveness of our approach.
引用
收藏
页码:236 / 245
页数:10
相关论文
共 50 条
  • [31] Real-Time Event Identification Through Low-Dimensional Subspace Characterization of High-Dimensional Synchrophasor Data
    Li, Wenting
    Wang, Meng
    Chow, Joe H.
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (05) : 4937 - 4947
  • [32] Converting high-dimensional regression to high-dimensional conditional density estimation
    Izbicki, Rafael
    Lee, Ann B.
    ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (02): : 2800 - 2831
  • [33] High-dimensional discriminant analysis
    Bouveyron, Charles
    Girard, Stephane
    Schmid, Cordelia
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2007, 36 (13-16) : 2607 - 2623
  • [34] Joint t-SNE for Comparable Projections of Multiple High-Dimensional Datasets
    Wang, Yinqiao
    Chen, Lu
    Jo, Jaemin
    Wang, Yunhai
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (01) : 623 - 632
  • [35] Well-Separated Pair Decompositions for High-Dimensional Datasets
    Matijevic, Domagoj
    ALGORITHMS, 2023, 16 (05)
  • [36] Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis
    Chen, Yunxiao
    Li, Xiaoou
    Zhang, Siliang
    PSYCHOMETRIKA, 2019, 84 (01) : 124 - 146
  • [37] Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis
    Yunxiao Chen
    Xiaoou Li
    Siliang Zhang
    Psychometrika, 2019, 84 : 124 - 146
  • [38] Low-dimensional tracking of association structures in categorical data
    D'Enza, Alfonso Iodice
    Markos, Angelos
    STATISTICS AND COMPUTING, 2015, 25 (05) : 1009 - 1022
  • [39] Photoreflectance spectroscopy of low-dimensional GaAs/AlGaAs structures
    Sek, G
    Misiewicz, J
    Cheng, TS
    ADVANCED MATERIALS FOR OPTICS AND ELECTRONICS, 1997, 7 (05): : 241 - 247
  • [40] Low-dimensional tracking of association structures in categorical data
    Alfonso Iodice D’Enza
    Angelos Markos
    Statistics and Computing, 2015, 25 : 1009 - 1022