Real Time Principal Component Analysis

被引:0
作者
Chowdhury, Ranak Roy [1 ]
Adnan, Muhammad Abdullah [1 ]
Gupta, Rajesh K. [2 ]
机构
[1] BUET, Dhaka, Bangladesh
[2] Univ Calif San Diego, San Diego, CA 92103 USA
来源
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019) | 2019年
关键词
Big Data; Real Time; Dimensionality Reduction; PCA;
D O I
10.1109/ICDE.2019.00171
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
By processing the data in motion, real-time data processing enables us to extract instantaneous results from online input data that ensures timely responsiveness to events as well as a much enhanced capacity to process large data sets. This is especially important when decision loops include querying and processing data on the web where size and latency considerations make it impossible to process raw data in real-time. This makes dimensionality reduction techniques, like principal component analysis (PCA), an important data preprocessing tool to gain insights into data. In this paper, we propose a variant of PCA, that is suited for real-time applications. In the real-time version of the PCA problem, we maintain a window over the most recent data and project every incoming row of data into lower dimensional subspace, which we generate as the output of the model. The goal is to minimize the reconstruction error of the output from the input. We use the reconstruction error as the termination criteria to update the eigenspace as new data arrives. To verify whether our proposed model can capture the essence of the changing distribution of large datasets in real-time, we have implemented the algorithm and evaluated performance against carefully designed simulations that change distributions of data sources over time in a controllable manner. Furthermore, we have demonstrated that our algorithm can capture the changing distributions of real-life datasets by running simulations on datasets from a variety of real-time applications e.g. localization, customer expenditure, etc. We propose algorithmic enhancements that rely upon spectral analysis to improve dimensionality reduction. Results show that our method can successfully capture the changing distribution of data in a real-time scenario, thus enabling real-time PCA.
引用
收藏
页码:1678 / 1681
页数:4
相关论文
共 50 条
  • [21] A New Basis for Sparse Principal Component Analysis
    Chen, Fan
    Rohe, Karl
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (02) : 421 - 434
  • [22] Anomaly Detection Based on Kernel Principal Component and Principal Component Analysis
    Wang, Wei
    Zhang, Min
    Wang, Dan
    Jiang, Yu
    Li, Yuliang
    Wu, Hongda
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2222 - 2228
  • [23] Exploration of Principal Component Analysis: Deriving Principal Component Analysis Visually Using Spectra
    Beattie, J. Renwick
    Esmonde-White, Francis W. L.
    APPLIED SPECTROSCOPY, 2021, 75 (04) : 361 - 375
  • [24] Real-time arc-welding defect detection and classification with principal component analysis and artificial neural networks
    Mirapeix, J.
    Garcia-Allende, P. B.
    Cobo, A.
    Conde, O. M.
    Lopez-Higuera, J. M.
    NDT & E INTERNATIONAL, 2007, 40 (04) : 315 - 323
  • [25] Effective dimensionality for principal component analysis of time series expression data
    Hörnquist, M
    Hertz, J
    Wahde, M
    BIOSYSTEMS, 2003, 71 (03) : 311 - 317
  • [26] Multivariate time series clustering based on common principal component analysis
    Li, Hailin
    NEUROCOMPUTING, 2019, 349 : 239 - 247
  • [27] Universum Principal Component Analysis
    Chen, Xiao-hong
    Ma, Di
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT ENGINEERING (ITME 2014), 2014, : 236 - 241
  • [28] Decomposable Principal Component Analysis
    Wiesel, Ami
    Hero, Alfred O.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (11) : 4369 - 4377
  • [29] Geometrical Approximated Principal Component Analysis for Hyperspectral Image Analysis
    Machidon, Alina L.
    Del Frate, Fabio
    Picchiani, Matteo
    Machidon, Octavian M.
    Ogrutan, Petre L.
    REMOTE SENSING, 2020, 12 (11)
  • [30] A Generalization of Principal Component Analysis
    Battaglino, Samuele
    Koyuncu, Erdem
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3607 - 3611