Data fusion using factor analysis and low-rank matrix completion

被引:0
作者
Ahfock, Daniel [1 ]
Pyne, Saumyadipta [2 ,3 ,4 ]
McLachlan, Geoffrey J. [1 ]
机构
[1] Univ Queensland, Sch Math & Phys, Brisbane, Qld, Australia
[2] Univ Pittsburgh, Grad Sch Publ Hlth, Publ Hlth Dynam Lab, Pittsburgh, PA USA
[3] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Biostat, Pittsburgh, PA 15261 USA
[4] Hlth Analyt Network, Pittsburgh, PA USA
基金
澳大利亚研究理事会;
关键词
Data fusion; Statistical file-matching; Low-rank matrix completion; Factor analysis; ALGORITHMS; NUMBER;
D O I
10.1007/s11222-021-10033-7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data fusion involves the integration of multiple related datasets. The statistical file-matching problem is a canonical data fusion problem in multivariate analysis, where the objective is to characterise the joint distribution of a set of variables when only strict subsets of marginal distributions have been observed. Estimation of the covariance matrix of the full set of variables is challenging given the missing-data pattern. Factor analysis models use lower-dimensional latent variables in the data-generating process, and this introduces low-rank components in the complete-data matrix and the population covariance matrix. The low-rank structure of the factor analysis model can be exploited to estimate the full covariance matrix from incomplete data via low-rank matrix completion. We prove the identifiability of the factor analysis model in the statistical file-matching problem under conditions on the number of factors and the number of shared variables over the observed marginal subsets. Additionally, we provide an EM algorithm for parameter estimation. On several real datasets, the factor model gives smaller reconstruction errors in file-matching problems than the common approaches for low-rank matrix completion.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Robust Video Super-resolution Using Low-rank Matrix Completion
    Liu, Chenyu
    Zhang, Xianlin
    Liu, Yang
    Li, Xueming
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING (ICVIP 2017), 2017, : 181 - 185
  • [22] Low-rank matrix completion using nuclear norm minimization and facial reduction
    Huang, Shimeng
    Wolkowicz, Henry
    JOURNAL OF GLOBAL OPTIMIZATION, 2018, 72 (01) : 5 - 26
  • [23] Video Deraining and Desnowing Using Temporal Correlation and Low-Rank Matrix Completion
    Kim, Jin-Hwan
    Sim, Jae-Young
    Kim, Chang-Su
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (09) : 2658 - 2670
  • [24] Low-Rank Hankel Matrix Completion for Robust Time-Frequency Analysis
    Zhang, Shuimei
    Zhang, Yimin D.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) : 6171 - 6186
  • [25] Low-rank matrix completion using nuclear norm minimization and facial reduction
    Shimeng Huang
    Henry Wolkowicz
    Journal of Global Optimization, 2018, 72 : 5 - 26
  • [26] Decomposition Approach for Low-Rank Matrix Completion and Its Applications
    Ma, Rick
    Barzigar, Nafise
    Roozgard, Aminmohammad
    Cheng, Samuel
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (07) : 1671 - 1683
  • [27] LOW-RANK MATRIX COMPLETION BY VARIATIONAL SPARSE BAYESIAN LEARNING
    Babacan, S. Derin
    Luessi, Martin
    Molina, Rafael
    Katsaggelos, Aggelos K.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2188 - 2191
  • [28] Nonlinear Low-Rank Matrix Completion for Human Motion Recovery
    Xia, Guiyu
    Sun, Huaijiang
    Chen, Beijia
    Liu, Qingshan
    Feng, Lei
    Zhang, Guoqing
    Hang, Renlong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) : 3011 - 3024
  • [29] Low-Rank Matrix Completion Theory via Plucker Coordinates
    Tsakiris, Manolis C.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 10084 - 10099
  • [30] Localization of IoT Networks via Low-Rank Matrix Completion
    Luong Trung Nguyen
    Kim, Junhan
    Kim, Sangtae
    Shim, Byonghyo
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (08) : 5833 - 5847