Detecting the large entries of a sparse covariance matrix in sub-quadratic time

被引:1
|
作者
Shwartz, Ofer [1 ]
Nadler, Boaz [1 ]
机构
[1] Weizmann Inst Sci, Dept Comp Sci, Rehovot, Israel
关键词
sparse covariance matrix; sub-quadratic time complexity; multi-scale group testing;
D O I
10.1093/imaiai/iaw004
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The covariance matrix of a p-dimensional random variable is a fundamental quantity in data analysis. Given n i.i.d. observations, it is typically estimated by the sample covariance matrix, at a computational cost of O(np(2)) operations. When n, p are large, this computation may be prohibitively slow. Moreover, in several contemporary applications, the population matrix is approximately sparse, and only its few large entries are of interest. This raises the following question: Assuming approximate sparsity of the covariance matrix, can its large entries be detected much faster, say in sub-quadratic time, without explicitly computing all its p(2) entries? In this paper, we present and theoretically analyze two randomized algorithms that detect the large entries of an approximately sparse sample covariance matrix using only O(np poly log p) operations. Furthermore, assuming sparsity of the population matrix, we derive sufficient conditions on the underlying random variable and on the number of samples n, for the sample covariance matrix to satisfy our approximate sparsity requirements. Finally, we illustrate the performance of our algorithms via several simulations.
引用
收藏
页码:304 / 330
页数:27
相关论文
共 30 条
  • [1] Interaction Hard Thresholding: Consistent Sparse Quadratic Regression in Sub-quadratic Time and Space
    Yang, Shuo
    Shen, Yanyao
    Sanghavi, Sujay
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [2] Matrix-Vector Multiplication in Sub-Quadratic Time (Some Preprocessing Required)
    Williams, Ryan
    PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2007, : 995 - 1001
  • [3] Radiosity algorithms running in sub-quadratic time
    Szirmay-Kalos, L
    Foris, T
    WSCG '97: THE FIFTH INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS AND VISUALIZATION '97, CONFERENCE PROCEEDINGS, VOL 1-4, 1997, : 552 - 561
  • [4] Learning directed acyclic graph SPNs in sub-quadratic time
    Ghose, Amur
    Jaini, Priyank
    Poupart, Pascal
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 120 : 48 - 73
  • [5] On algorithms for permuting large entries to the diagonal of a sparse matrix
    Duff, IS
    Koster, J
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2001, 22 (04) : 973 - 996
  • [6] Completing gene trees without species trees in sub-quadratic time
    Mai, Uyen
    Mirarab, Siavash
    BIOINFORMATICS, 2022, 38 (06) : 1532 - 1541
  • [7] The application of sparse estimation of covariance matrix to quadratic discriminant analysis
    Sun, Jiehuan
    Zhao, Hongyu
    BMC BIOINFORMATICS, 2015, 16
  • [8] The application of sparse estimation of covariance matrix to quadratic discriminant analysis
    Jiehuan Sun
    Hongyu Zhao
    BMC Bioinformatics, 16
  • [9] Approximating Edit Distance Within Constant Factor in Truly Sub-Quadratic Time
    Chakraborty, Diptarka
    Das, Debarati
    Goldenberg, Elazar
    Koucky, Michal
    Saks, Michael
    2018 IEEE 59TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2018, : 979 - 990
  • [10] Approximating Edit Distance Within Constant Factor in Truly Sub-quadratic Time
    Chakraborty, Diptarka
    Das, Debarati
    Goldenberg, Elazar
    Koucky, Michal
    Saks, Michael
    JOURNAL OF THE ACM, 2020, 67 (06)