Out-of-sample data visualization using bi-kernel t-SNE

被引:7
作者
Zhang, Haili [1 ,2 ,3 ]
Wang, Pu [1 ,2 ,3 ]
Gao, Xuejin [1 ,2 ,3 ]
Qi, Yongsheng [4 ]
Gao, Huihui [1 ,2 ,3 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Minist Educ, Engn Res Ctr Digital Community, Beijing, Peoples R China
[3] Beijing Lab Urban Mass Transit, Beijing, Peoples R China
[4] Inner Mongolia Univ Technol, Sch Elect Power, Hohhot, Inner Mongolia, Peoples R China
基金
中国国家自然科学基金;
关键词
Data visualization; dimensionality reduction; T-SNE; out-of-sample extension; outlier projection; PRINCIPAL COMPONENT ANALYSIS; DIMENSIONALITY REDUCTION; ISOMAP;
D O I
10.1177/1473871620978209
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
T-distributed stochastic neighbor embedding (t-SNE) is an effective visualization method. However, it is non-parametric and cannot be applied to steaming data or online scenarios. Although kernel t-SNE provides an explicit projection from a high-dimensional data space to a low-dimensional feature space, some outliers are not well projected. In this paper, bi-kernel t-SNE is proposed for out-of-sample data visualization. Gaussian kernel matrices of the input and feature spaces are used to approximate the explicit projection. Then principal component analysis is applied to reduce the dimensionality of the feature kernel matrix. Thus, the difference between inliers and outliers is revealed. And any new sample can be well mapped. The performance of the proposed method for out-of-sample projection is tested on several benchmark datasets by comparing it with other state-of-the-art algorithms.
引用
收藏
页码:20 / 34
页数:15
相关论文
共 50 条
  • [31] Analysis of laser ablation spectral data using dimensionality reduction techniques: PCA, t-SNE and UMAP
    Rabasovic, M. S.
    Pavlovic, D. M.
    Sevic, D.
    CONTRIBUTIONS OF THE ASTRONOMICAL OBSERVATORY SKALNATE PLESO, 2023, 53 (03): : 51 - 57
  • [32] Phonetic Segmentation of Speech using STEP and t-SNE
    Stan, Adriana
    Valentini-Botinhao, Cassia
    Giurgiu, Mircea
    King, Simon
    2015 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2015,
  • [33] A t-SNE Based Classification Approach to Compositional Microbiome Data
    Xu, Xueli
    Xie, Zhongming
    Yang, Zhenyu
    Li, Dongfang
    Xu, Ximing
    FRONTIERS IN GENETICS, 2020, 11
  • [34] Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data
    Cai, T. Tony
    Ma, Rong
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [35] Examining Intermediate Data Reduction Algorithms for use with t-SNE
    Campbell, Aaron
    Caudle, Kyle
    Hoover, Randy C.
    PROCEEDINGS OF THE 2019 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTE AND DATA ANALYSIS (ICCDA 2019), 2019, : 36 - 42
  • [36] t-SNE for Complex Multi-Manifold High-Dimensional Data
    Bian R.
    Zhang J.
    Zhou L.
    Jiang P.
    Chen B.
    Wang Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (11): : 1746 - 1754
  • [37] Kernel propagation strategy: A novel out-of-sample propagation projection for subspace learning
    Su, Shuzhi
    Ge, Hongwei
    Yuan, Yun-Hao
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 36 : 69 - 79
  • [38] Improving the Accuracy of Convolutional Neural Networks by Identifying and Removing Outlier Images in Datasets Using t-SNE
    Perez, Husein
    Tah, Joseph H. M.
    MATHEMATICS, 2020, 8 (05)
  • [39] Accelerating t-SNE using Fast Fourier Transforms and the Particle-Mesh Algorithm from Physics
    Delchevalerie, Valentin
    Mayer, Alexandre
    Bibal, Adrien
    Frenay, Benoit
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [40] Incremental Multi-manifold Out-of-Sample Data Prediction
    Liu, Zhongxin
    Wang, Wenmin
    Wang, Ronggang
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2014, : 481 - 486