Minimax Quasi-Bayesian Estimation in Sparse Canonical Correlation Analysis via a Rayleigh Quotient Function

被引:0
作者
Zhu, Qiuyun [1 ,2 ]
Atchade, Yves [1 ]
机构
[1] Boston Univ, Dept Math & Stat, Boston, MA USA
[2] Univ Minnesota, Sch Stat, St Paul, MN 55455 USA
关键词
Covid-19; Markov chain Monte Carlo; Minimax estimation; Quasi-Bayesian inference; Simulated annealing; Simulated tempering; Sparse CCA; CHAIN MONTE-CARLO; MODEL;
D O I
10.1080/01621459.2023.2271199
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Canonical correlation analysis (CCA) is a popular statistical technique for exploring relationships between datasets. In recent years, the estimation of sparse canonical vectors has emerged as an important but challenging variant of the CCA problem, with widespread applications. Unfortunately, existing rate-optimal estimators for sparse canonical vectors have high computational cost. We propose a quasi-Bayesian estimation procedure that not only achieves the minimax estimation rate, but also is easy to compute by Markov chain Monte Carlo (MCMC). The method builds on (Tan et al.) and uses a rescaled Rayleigh quotient function as the quasi-log-likelihood. However, unlike (Tan et al.), we adopt a Bayesian framework that combines this quasi-log-likelihood with a spike-and-slab prior to regularize the inference and promote sparsity. We investigate the empirical behavior of the proposed method on both continuous and truncated data, and we demonstrate that it outperforms several state-of-the-art methods. As an application, we use the proposed methodology to maximally correlate clinical variables and proteomic data for better understanding the Covid-19 disease. Supplementary materials for this article are available online.
引用
收藏
页码:2647 / 2657
页数:11
相关论文
共 39 条
  • [1] Andrew G., 2013, Proceedings of the 30th International Conference on Machine Learning, P1247
  • [2] [Anonymous], 2008, MONTE CARLO STRATEGI
  • [3] SIMULATED ANNEALING
    BERTSIMAS, D
    TSITSIKLIS, J
    [J]. STATISTICAL SCIENCE, 1993, 8 (01) : 10 - 15
  • [4] Bhattacharyya A, 2019, Arxiv, DOI arXiv:1907.01170
  • [5] A general framework for updating belief distributions
    Bissiri, P. G.
    Holmes, C. C.
    Walker, S. G.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (05) : 1103 - 1130
  • [6] CARLIN BP, 1995, J ROY STAT SOC B MET, V57, P473
  • [7] Catoni O., 2001, Lecture notes from the 31st Summer School on Probability Theory held in Saint-Flour
  • [8] An MCMC approach to classical estimation
    Chernozhukov, V
    Hong, H
    [J]. JOURNAL OF ECONOMETRICS, 2003, 115 (02) : 293 - 346
  • [9] High dimensional semiparametric latent graphical model for mixed data
    Fan, Jianqing
    Liu, Han
    Ning, Yang
    Zou, Hui
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2017, 79 (02) : 405 - 421
  • [10] CONTROL OF MALARIA VIRULENCE BY ALPHA-1-ACID GLYCOPROTEIN (OROSOMUCOID), AN ACUTE-PHASE (INFLAMMATORY) REACTANT
    FRIEDMAN, MJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1983, 80 (17): : 5421 - 5424