An efficient parallel block coordinate descent algorithm for large-scale precision matrix estimation using graphics processing units

被引:2
|
作者
Choi, Young-Geun [1 ]
Lee, Seunghwan [2 ]
Yu, Donghyeon [2 ]
机构
[1] Sookmyung Womens Univ, Dept Stat, Seoul, South Korea
[2] Inha Univ, Dept Stat, Incheon, South Korea
基金
新加坡国家研究基金会;
关键词
CONCORD; Edge coloring; Parallel coordinate descent; Graphical model; GPU-parallel computation; INVERSE COVARIANCE ESTIMATION; SPARSE; CONVERGENCE; LASSO; REGRESSION; SELECTION; INSIGHTS;
D O I
10.1007/s00180-021-01127-x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large-scale sparse precision matrix estimation has attracted wide interest from the statistics community. The convex partial correlation selection method (CONCORD) developed by Khare et al. (J R Stat Soc Ser B (Stat Methodol) 77(4):803-825, 2015) has recently been credited with some theoretical properties for estimating sparse precision matrices. The CONCORD obtains its solution by a coordinate descent algorithm (CONCORD-CD) based on the convexity of the objective function. However, since a coordinate-wise update in CONCORD-CD is inherently serial, a scale-up is nontrivial. In this paper, we propose a novel parallelization of CONCORD-CD, namely, CONCORD-PCD. CONCORD-PCD partitions the off-diagonal elements into several groups and updates each group simultaneously without harming the computational convergence of CONCORD-CD. We guarantee this by employing the notion of edge coloring in graph theory. Specifically, we establish a nontrivial correspondence between scheduling the updates of the off-diagonal elements in CONCORD-CD and coloring the edges of a complete graph. It turns out that CONCORD-PCD simultanoeusly updates off-diagonal elements in which the associated edges are colorable with the same color. As a result, the number of steps required for updating off-diagonal elements reduces from p(p - 1)/2 to p - 1 (for even p) or p (for odd p), where p denotes the number of variables. We prove that the number of such steps is irreducible In addition, CONCORD-PCD is tailored to single-instruction multiple-data (SIMD) parallelism. A numerical study shows that the SIMD-parallelized PCD algorithm implemented in graphics processing units boosts the CONCORD-CD algorithm multiple times. The method is available in the R package pcdconcord.
引用
收藏
页码:419 / 443
页数:25
相关论文
共 50 条
  • [1] An efficient parallel block coordinate descent algorithm for large-scale precision matrix estimation using graphics processing units
    Young-Geun Choi
    Seunghwan Lee
    Donghyeon Yu
    Computational Statistics, 2022, 37 : 419 - 443
  • [2] Stochastic Parallel Block Coordinate Descent for Large-Scale Saddle Point Problems
    Zhu, Zhanxing
    Storkey, Amos J.
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2429 - 2435
  • [3] An efficient GPU-parallel coordinate descent algorithm for sparse precision matrix estimation via scaled lasso
    Lee, Seunghwan
    Kim, Sang Cheol
    Yu, Donghyeon
    COMPUTATIONAL STATISTICS, 2023, 38 (01) : 217 - 242
  • [4] An efficient GPU-parallel coordinate descent algorithm for sparse precision matrix estimation via scaled lasso
    Seunghwan Lee
    Sang Cheol Kim
    Donghyeon Yu
    Computational Statistics, 2023, 38 : 217 - 242
  • [5] On the flexibility of block coordinate descent for large-scale optimization
    Wang, Xiangfeng
    Zhang, Wenjie
    Yan, Junchi
    Yuan, Xiaoming
    Zha, Hongyuan
    NEUROCOMPUTING, 2018, 272 : 471 - 480
  • [6] A Block-Coordinate Descent Approach for Large-scale Sparse Inverse Covariance Estimation
    Treister, Eran
    Turek, Javier
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [7] Block-enhanced precision matrix estimation for large-scale datasets
    Eftekhari, Aryan
    Pasadakis, Dimosthenis
    Bollhoefer, Matthias
    Scheidegger, Simon
    Schenk, Olaf
    JOURNAL OF COMPUTATIONAL SCIENCE, 2021, 53 (53)
  • [8] Large-scale ferrofluid simulations on graphics processing units
    Polyakov, A. Yu.
    Lyutyy, T. V.
    Denisov, S.
    Reva, V. V.
    Haenggi, P.
    COMPUTER PHYSICS COMMUNICATIONS, 2013, 184 (06) : 1483 - 1489
  • [9] Efficient Asynchronous Semi-Stochastic Block Coordinate Descent Methods for Large-Scale SVD
    Shang, Fanhua
    Zhang, Zhihui
    Liu, Yuanyuan
    Liu, Hongying
    Xu, Jing
    IEEE ACCESS, 2021, 9 : 126159 - 126171
  • [10] DOA Estimation Using a Greedy Block Coordinate Descent Algorithm
    Wei, Xiaohan
    Yuan, Yabo
    Ling, Qing
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (12) : 6382 - 6394