Real Time Principal Component Analysis

被引:0
作者
Chowdhury, Ranak Roy [1 ]
Adnan, Muhammad Abdullah [1 ]
Gupta, Rajesh K. [2 ]
机构
[1] BUET, Dhaka, Bangladesh
[2] Univ Calif San Diego, San Diego, CA 92103 USA
来源
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019) | 2019年
关键词
Big Data; Real Time; Dimensionality Reduction; PCA;
D O I
10.1109/ICDE.2019.00171
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
By processing the data in motion, real-time data processing enables us to extract instantaneous results from online input data that ensures timely responsiveness to events as well as a much enhanced capacity to process large data sets. This is especially important when decision loops include querying and processing data on the web where size and latency considerations make it impossible to process raw data in real-time. This makes dimensionality reduction techniques, like principal component analysis (PCA), an important data preprocessing tool to gain insights into data. In this paper, we propose a variant of PCA, that is suited for real-time applications. In the real-time version of the PCA problem, we maintain a window over the most recent data and project every incoming row of data into lower dimensional subspace, which we generate as the output of the model. The goal is to minimize the reconstruction error of the output from the input. We use the reconstruction error as the termination criteria to update the eigenspace as new data arrives. To verify whether our proposed model can capture the essence of the changing distribution of large datasets in real-time, we have implemented the algorithm and evaluated performance against carefully designed simulations that change distributions of data sources over time in a controllable manner. Furthermore, we have demonstrated that our algorithm can capture the changing distributions of real-life datasets by running simulations on datasets from a variety of real-time applications e.g. localization, customer expenditure, etc. We propose algorithmic enhancements that rely upon spectral analysis to improve dimensionality reduction. Results show that our method can successfully capture the changing distribution of data in a real-time scenario, thus enabling real-time PCA.
引用
收藏
页码:1678 / 1681
页数:4
相关论文
共 50 条
[31]   Merging model-based two-dimensional principal component analysis [J].
Cui, Kai ;
Gao, Quanxue ;
Zhang, Hailin ;
Gao, Xinbo ;
Xie, Deyan .
NEUROCOMPUTING, 2015, 168 :1198-1206
[32]   Real-time dynamic MR image reconstruction using compressed sensing and principal component analysis (CS-PCA): Demonstration in lung tumor tracking [J].
Dietz, Bryson ;
Yip, Eugene ;
Yun, Jihyun ;
Fallone, B. Gino ;
Wachowicz, Keith .
MEDICAL PHYSICS, 2017, 44 (08) :3978-3989
[33]   Local and global principal component analysis for process monitoring [J].
Yu, Jianbo .
JOURNAL OF PROCESS CONTROL, 2012, 22 (07) :1358-1373
[34]   Backwards Principal Component Analysis and Principal Nested Relations [J].
Damon, James ;
Marron, J. S. .
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2014, 50 (1-2) :107-114
[35]   A preliminary geometric structure simplification for Principal Component Analysis [J].
Gu, Huamao ;
Lin, Tong ;
Wang, Xun .
NEUROCOMPUTING, 2019, 336 :46-55
[36]   Principal component model of multispectral data for near real-time skin chromophore mapping [J].
Kainerstorfer, Jana M. ;
Ehler, Martin ;
Amyot, Franck ;
Hassan, Moinuddin ;
Demos, Stavros G. ;
Chernomordik, Victor ;
Hitzenberger, Christoph K. ;
Gandjbakhche, Amir H. ;
Riley, Jason D. .
JOURNAL OF BIOMEDICAL OPTICS, 2010, 15 (04)
[37]   Linear-PoseNet: A Real-Time Camera Pose Estimation System Using Linear Regression and Principal Component Analysis [J].
Elmoogy, Ahmed ;
Dong, Xiaodai ;
Lu, Tao ;
Westendorp, Robert ;
Reddy, Kishore .
2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
[38]   Schrodinger principal-component analysis: On the duality between principal-component analysis and the Schrodinger equation [J].
Liu, Ziming ;
Qian, Sitian ;
Wang, Yixuan ;
Yan, Yuxuan ;
Yang, Tianyi .
PHYSICAL REVIEW E, 2021, 104 (02)
[39]   Nonlinear principal component analysis for withdrawal from the employment time guarantee fund [J].
Li, Weigang ;
de Moraes, Aipore Rodrigues ;
Lihua, Shi ;
Matsushita, Raul Yukihiro .
COMPUTATIONAL INTELLIGENCE IN ECONOMICS AND FINANCE, VOL II, 2007, :75-+
[40]   Asymptotic theory of principal component analysis for time series data with cautionary comments [J].
Zhang, Xinyu ;
Tong, Howell .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2022, 185 (02) :543-565