A graph regularized dimension reduction method for out-of-sample data

被引:11
|
作者
Tang, Mengfan [1 ]
Nie, Feiping [2 ,3 ]
Jain, Ramesh [1 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[3] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian, Peoples R China
关键词
Dimension reduction; Out-of-sample data; Graph regularized PCA; Manifold learning; Clustering; RECOGNITION; EIGENMAPS;
D O I
10.1016/j.neucom.2016.11.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Among various dimension reduction techniques, Principal Component Analysis (PCA) is specialized in treating vector data, whereas Laplacian embedding is often employed for embedding graph data. Moreover, graph regularized PCA, a combination of both techniques, has also been developed to assist the learning of a low dimensional representation of vector data by incorporating graph data. However, these approaches are confronted by the out-of-sample problem: each time when new data is added, it has to be combined with the old data before being fed into the algorithm to re-compute the eigenvectors, leading to enormous computational cost. In order to address this problem, we extend the graph regularized PCA to the graph regularized linear regression PCA (grlrPCA). grlrPCA eliminates the redundant calculation on the old data by first learning a linear function and then directly applying it to the new data for its dimension reduction. Furthermore, we derive an efficient iterative algorithm to solve grlrPCA optimization problem and show the close relatedness of grlrPCA and unsupervised Linear Discriminant Analysis at infinite regularization parameter limit. The evaluations of multiple metrics on seven realistic datasets demonstrate that grlrPCA outperforms established unsupervised dimension reduction algorithms.
引用
收藏
页码:58 / 63
页数:6
相关论文
共 50 条
  • [1] Image classification with manifold learning for out-of-sample data
    Han, Yahong
    Xu, Zhongwen
    Ma, Zhigang
    Huang, Zi
    SIGNAL PROCESSING, 2013, 93 (08) : 2169 - 2177
  • [2] Out-of-Sample Extension for Dimensionality Reduction of Noisy Time Series
    Dadkhahi, Hamid
    Duarte, Marco F.
    Marlin, Benjamin M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) : 5435 - 5446
  • [3] ISOMAP OUT-OF-SAMPLE EXTENSION FOR NOISY TIME SERIES DATA
    Dadkhahi, Hamid
    Duarte, Marco F.
    Marlin, Benjamin
    2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [4] Incremental Multi-manifold Out-of-Sample Data Prediction
    Liu, Zhongxin
    Wang, Wenmin
    Wang, Ronggang
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2014, : 481 - 486
  • [5] Classification of small lesions on dynamic breast MRI: Integrating dimension reduction and out-of-sample extension into CADx methodology
    Nagarajan, Mahesh B.
    Huber, Markus B.
    Schlossbauer, Thomas
    Leinsinger, Gerda
    Krol, Andrzej
    Wismueller, Axel
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2014, 60 (01) : 65 - 77
  • [6] Generalizing Time Aggregation to Out-of-Sample Data Using Minimum Bipartite Graph Matching for Power Systems Studies
    Sarajpoor, Nima
    Rakai, Logan
    Amjady, Nima
    Zareipour, Hamidreza
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (03) : 5352 - 5365
  • [7] A Sparse-Grid-Based Out-of-Sample Extension for Dimensionality Reduction and Clustering with Laplacian Eigenmaps
    Peherstorfer, Benjamin
    Pflueger, Dirk
    Bungartz, Hans-Joachim
    AI 2011: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7106 : 112 - 121
  • [8] Out-of-Sample Generalizations for Supervised Manifold Learning for Classification
    Vural, Elif
    Guillemot, Christine
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (03) : 1410 - 1424
  • [9] Out-of-sample extrapolation of learned manifolds
    Chin, Tat-Jun
    Suter, David
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (09) : 1547 - 1556
  • [10] Mathematical analysis on out-of-sample extensions
    Wang, Jianzhong
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2018, 16 (05)