Detecting the large entries of a sparse covariance matrix in sub-quadratic time

被引:1
|
作者
Shwartz, Ofer [1 ]
Nadler, Boaz [1 ]
机构
[1] Weizmann Inst Sci, Dept Comp Sci, Rehovot, Israel
关键词
sparse covariance matrix; sub-quadratic time complexity; multi-scale group testing;
D O I
10.1093/imaiai/iaw004
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The covariance matrix of a p-dimensional random variable is a fundamental quantity in data analysis. Given n i.i.d. observations, it is typically estimated by the sample covariance matrix, at a computational cost of O(np(2)) operations. When n, p are large, this computation may be prohibitively slow. Moreover, in several contemporary applications, the population matrix is approximately sparse, and only its few large entries are of interest. This raises the following question: Assuming approximate sparsity of the covariance matrix, can its large entries be detected much faster, say in sub-quadratic time, without explicitly computing all its p(2) entries? In this paper, we present and theoretically analyze two randomized algorithms that detect the large entries of an approximately sparse sample covariance matrix using only O(np poly log p) operations. Furthermore, assuming sparsity of the population matrix, we derive sufficient conditions on the underlying random variable and on the number of samples n, for the sample covariance matrix to satisfy our approximate sparsity requirements. Finally, we illustrate the performance of our algorithms via several simulations.
引用
收藏
页码:304 / 330
页数:27
相关论文
共 30 条
  • [21] Finding off-diagonal entries of the inverse of a large symmetric sparse matrix
    Eastwood, Shawn
    Wan, Justin W. L.
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2013, 20 (01) : 74 - 92
  • [22] A Sub-quadratic Time and Space Complexity Solution for the Dated Tree Reconciliation Problem for Select Tree Topologies
    Drinkwater, Benjamin
    Charleston, Michael A.
    ALGORITHMS IN BIOINFORMATICS (WABI 2015), 2015, 9289 : 93 - 107
  • [23] Existence, uniqueness and comparison theorem on unbounded solutions of general time-interval BSDEs with sub-quadratic generators
    Gu, Chuang
    Wang, Yan
    Fan, Shengjun
    PROBABILITY UNCERTAINTY AND QUANTITATIVE RISK, 2025, 10 (01): : 31 - 58
  • [24] Existence, uniqueness and comparison theorem on unbounded solutions of general time-interval BSDEs with sub-quadratic generators
    Gu, Chuang
    Wang, Yan
    Fan, Shengjun
    PROBABILITY UNCERTAINTY AND QUANTITATIVE RISK, 2025,
  • [25] Exact state and covariance sub-matrix recovery for submap based sparse EIF SLAM algorithm
    Huang, Shoudong
    Wang, Zhan
    Dissanayake, Gamini
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1868 - 1873
  • [26] A suggestion for constructing a large time-varying conditional covariance matrix
    Gibson, Heather D.
    Hall, Stephen G.
    Tavlas, George S.
    ECONOMICS LETTERS, 2017, 156 : 110 - 113
  • [27] ADMM Algorithmic Regularization Paths for Sparse and Large Scale Positive-Definite Covariance Matrix Estimation
    XIA Lin
    WANG Guanpeng
    HUANG Xudong
    WuhanUniversityJournalofNaturalSciences, 2022, 27 (02) : 128 - 134
  • [28] Large-Scale Sparse Inverse Covariance Estimation via Thresholding and Max-Det Matrix Completion
    Zhang, Richard Y.
    Fattahi, Salar
    Sojoudi, Somayeh
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [29] Covariance matrix interpolations of random vector X from AX = B with large and sparse A and stationary random B
    Sirisawang, Nutpol
    Charnsethikul, Peerayuth
    Computational Methods, Pts 1 and 2, 2006, : 475 - 480
  • [30] Time-memory trade-offs using sparse matrix methods for large-scale eigenvalue problems
    Teranishi, K
    Raghavan, P
    Yang, C
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 1, PROCEEDINGS, 2003, 2667 : 840 - 847