Out-of-sample data visualization using bi-kernel t-SNE

被引:7
作者
Zhang, Haili [1 ,2 ,3 ]
Wang, Pu [1 ,2 ,3 ]
Gao, Xuejin [1 ,2 ,3 ]
Qi, Yongsheng [4 ]
Gao, Huihui [1 ,2 ,3 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Minist Educ, Engn Res Ctr Digital Community, Beijing, Peoples R China
[3] Beijing Lab Urban Mass Transit, Beijing, Peoples R China
[4] Inner Mongolia Univ Technol, Sch Elect Power, Hohhot, Inner Mongolia, Peoples R China
基金
中国国家自然科学基金;
关键词
Data visualization; dimensionality reduction; T-SNE; out-of-sample extension; outlier projection; PRINCIPAL COMPONENT ANALYSIS; DIMENSIONALITY REDUCTION; ISOMAP;
D O I
10.1177/1473871620978209
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
T-distributed stochastic neighbor embedding (t-SNE) is an effective visualization method. However, it is non-parametric and cannot be applied to steaming data or online scenarios. Although kernel t-SNE provides an explicit projection from a high-dimensional data space to a low-dimensional feature space, some outliers are not well projected. In this paper, bi-kernel t-SNE is proposed for out-of-sample data visualization. Gaussian kernel matrices of the input and feature spaces are used to approximate the explicit projection. Then principal component analysis is applied to reduce the dimensionality of the feature kernel matrix. Thus, the difference between inliers and outliers is revealed. And any new sample can be well mapped. The performance of the proposed method for out-of-sample projection is tested on several benchmark datasets by comparing it with other state-of-the-art algorithms.
引用
收藏
页码:20 / 34
页数:15
相关论文
共 50 条
  • [41] Incremental Multi-manifold Out-of-Sample Data Prediction
    Liu, Zhongxin
    Wang, Wenmin
    Wang, Ronggang
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2014, : 481 - 486
  • [42] Accelerating t-SNE using Tree-Based Algorithms
    van der Maaten, Laurens
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3221 - 3245
  • [43] Wind Farm NWP Data Preprocessing Method Based on t-SNE
    Gu, Jiu
    Wang, Yining
    Xie, Da
    Zhang, Yu
    ENERGIES, 2019, 12 (19)
  • [44] Speaker Recognition System Based on Identity Vector Using t-SNE Visualization and Mean-shift Algorithm
    Kiani, Kourosh
    Baniasadi, Atefeh
    2019 5TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS 2019), 2019,
  • [45] Classification of Categorical Data Based on the Chi-Square Dissimilarity and t-SNE
    Cardona, Luis Ariosto Serna
    Vargas-Cardona, Hernan Dario
    Navarro Gonzalez, Piedad
    Cardenas Pena, David Augusto
    Orozco Gutierrez, Alvaro Angel
    COMPUTATION, 2020, 8 (04) : 1 - 15
  • [46] Understanding How Dimension Reduction Tools Work: An Empirical Approach to Deciphering t-SNE, UMAP, TriMap, and PaCMAP for Data Visualization
    Wang, Yingfan
    Huang, Haiyang
    Rudin, Cynthia
    Shaposhnik, Yaron
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [47] Dimensionality reduction and sensitivity improvement for TACTIC Cherenkov data using t-SNE machine learning algorithm
    Das, M. P.
    Dhar, V. K.
    Verma, S.
    Yadav, K. K.
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2023, 1057
  • [48] Separability of Histogram Based Features for Optical Performance Monitoring: An Investigation Using t-SNE Technique
    Saif, Waddah S.
    Alshawi, Tariq
    Esmail, Maged Abdullah
    Ragheb, Amr
    Alshebeili, Saleh
    IEEE PHOTONICS JOURNAL, 2019, 11 (03):
  • [49] SLOWLY MOVING TARGET DETECTION USING T-SNE AND SUPPORT VECTOR MACHINE
    Fang, Dan
    Su, Jia
    Li, Tao
    Fan, Yifei
    Tao, Mingliang
    Liang, Jiawang
    Shi, Jiao
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 883 - 886
  • [50] Improved t-SNE based manifold dimensional reduction for remote sensing data processing
    Song, Weijing
    Wang, Lizhe
    Liu, Peng
    Choo, Kim-Kwang Raymond
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (04) : 4311 - 4326