PCA-KL: a parametric dimensionality reduction approach for unsupervised metric learning

被引:18
作者
Levada, Alexandre L. M. [1 ]
机构
[1] Univ Fed Sao Carlos, Comp Dept, Sao Carlos, Brazil
关键词
Dimensionality reduction; PCA; KL-divergence; Unsupervised Metric learning; ALGORITHMS;
D O I
10.1007/s11634-020-00434-3
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Dimensionality reduction algorithms are powerful mathematical tools for data analysis and visualization. In many pattern recognition applications, a feature extraction step is often required to mitigate the curse of the dimensionality, a collection of negative effects caused by an arbitrary increase in the number of features in classification tasks. Principal Component Analysis (PCA) is a classical statistical method that creates new features based on linear combinations of the original ones through the eigenvectors of the covariance matrix. In this paper, we propose PCA-KL, a parametric dimensionality reduction algorithm for unsupervised metric learning, based on the computation of the entropic covariance matrix, a surrogate for the covariance matrix of the data obtained in terms of the relative entropy between local Gaussian distributions instead of the usual Euclidean distance between the data points. Numerical experiments with several real datasets show that the proposed method is capable of producing better defined clusters and also higher classification accuracy in comparison to regular PCA and several manifold learning algorithms, making PCA-KL a promising alternative for unsupervised metric learning.
引用
收藏
页码:829 / 868
页数:40
相关论文
共 52 条
[1]  
[Anonymous], 2002, Tech. Rep. PH -2002-01
[2]  
[Anonymous], INTRO STAT PATTERN R
[3]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[4]  
Bellet A., 2013, SURVEY METRIC LEARNI
[5]  
Bellman R. E., 1961, Adaptive Control Processes: A Guided Tour
[6]  
Borg I., 2005, MODERN MULTIDIMENSIO, DOI DOI 10.1007/0-387-28981-X
[7]  
CarreiraPerpinn M. A., 1997, CS9609 U SHEFF, V9, P1
[8]   Searching in metric spaces [J].
Chávez, E ;
Navarro, G ;
BaezaYates, R ;
Marroquín, JL .
ACM COMPUTING SURVEYS, 2001, 33 (03) :273-321
[9]  
Cook J., 2007, Artificial Intelligence and Statistics, P67
[10]  
Cormen TH., 2009, Introduction to Algorithms, V3