LDSScanner: Exploratory Analysis of Low-Dimensional Structures in High-Dimensional Datasets

被引:59
|
作者
Xia, Jiazhi [1 ]
Ye, Fenjin [1 ]
Chen, Wei [2 ]
Wang, Yusi [1 ]
Chen, Weifeng [3 ]
Ma, Yuxin [2 ]
Tung, Anthony K. H. [4 ]
机构
[1] Cent South Univ, Changsha, Hunan, Peoples R China
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Zhejiang Univ Finance & Econ, Hangzhou, Zhejiang, Peoples R China
[4] Natl Univ Singapore, Singapore, Singapore
基金
国家自然科学基金重大项目; 美国国家科学基金会;
关键词
High-dimensional data; low-dimensional structure; subspace; manifold; visual exploration; VISUAL EXPLORATION; VISUALIZATION; REDUCTION; METRICS;
D O I
10.1109/TVCG.2017.2744098
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many approaches for analyzing a high-dimensional dataset assume that the dataset contains specific structures, e.g., clusters in linear subspaces or non-linear manifolds. This yields a trial-and-error process to verify the appropriate model and parameters. This paper contributes an exploratory interface that supports visual identification of low-dimensional structures in a high-dimensional dataset, and facilitates the optimized selection of data models and configurations. Our key idea is to abstract a set of global and local feature descriptors from the neighborhood graph-based representation of the latent low-dimensional structure, such as pairwise geodesic distance (GD) among points and pairwise local tangent space divergence (LTSD) among pointwise local tangent spaces (LTS). We propose a new LTSD-GD view, which is constructed by mapping LTSD and GD to the x axis and y axis using 1D multidimensional scaling, respectively. Unlike traditional dimensionality reduction methods that preserve various kinds of distances among points, the LTSD-GD view presents the distribution of pointwise LTS (x axis) and the variation of LTS in structures (the combination of x axis and y axis). We design and implement a suite of visual tools for navigating and reasoning about intrinsic structures of a high-dimensional dataset. Three case studies verify the effectiveness of our approach.
引用
收藏
页码:236 / 245
页数:10
相关论文
共 50 条
  • [1] Missing Data Recovery for High-Dimensional Signals With Nonlinear Low-Dimensional Structures
    Gao, Pengzhi
    Wang, Meng
    Chow, Joe H.
    Berger, Matthew
    Seversky, Lee M.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (20) : 5421 - 5436
  • [2] High-dimensional data visualization by interactive construction of low-dimensional parallel coordinate plots
    Itoh, Takayuki
    Kumar, Ashnil
    Klein, Karsten
    Kim, Jinman
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2017, 43 : 1 - 13
  • [3] Optimizations on unknown low-dimensional structures given by high-dimensional data
    Qili Chen
    Jiuhe Wang
    Qiao Junfei
    Ming Yi Zou
    Soft Computing, 2021, 25 : 12717 - 12723
  • [4] Optimizations on unknown low-dimensional structures given by high-dimensional data
    Chen, Qili
    Wang, Jiuhe
    Qiao Junfei
    Zou, Ming Yi
    SOFT COMPUTING, 2021, 25 (20) : 12717 - 12723
  • [5] LEARNING LOW-DIMENSIONAL NONLINEAR STRUCTURES FROM HIGH-DIMENSIONAL NOISY DATA: AN INTEGRAL OPERATOR APPROACH
    Ding, Xiucai
    Ma, Rong
    ANNALS OF STATISTICS, 2023, 51 (04) : 1744 - 1769
  • [6] Low-Dimensional Approximations of High-Dimensional Asset Price Models
    Redmann, Martin
    Bayer, Christian
    Goyal, Pawan
    SIAM JOURNAL ON FINANCIAL MATHEMATICS, 2021, 12 (01): : 1 - 28
  • [7] A High-Dimensional Test for Multivariate Analysis of Variance Under a Low-Dimensional Factor Structure
    Cao, Mingxiang
    Zhao, Yanling
    Xu, Kai
    He, Daojiang
    Huang, Xudong
    COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2022, 10 (04) : 581 - 597
  • [8] A High-Dimensional Test for Multivariate Analysis of Variance Under a Low-Dimensional Factor Structure
    Mingxiang Cao
    Yanling Zhao
    Kai Xu
    Daojiang He
    Xudong Huang
    Communications in Mathematics and Statistics, 2022, 10 : 581 - 597
  • [9] ON THE CONDITIONAL DISTRIBUTIONS OF LOW-DIMENSIONAL PROJECTIONS FROM HIGH-DIMENSIONAL DATA
    Leeb, Hannes
    ANNALS OF STATISTICS, 2013, 41 (02) : 464 - 483
  • [10] High-dimensional variable selection via low-dimensional adaptive learning
    Staerk, Christian
    Kateri, Maria
    Ntzoufras, Ioannis
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (01): : 830 - 879