Bayesian Optimal Two-Sample Tests for High-Dimensional Gaussian Populations

被引:2
作者
Lee, Kyoungjae [1 ]
You, Kisung [2 ]
Lin, Lizhen [3 ]
机构
[1] Sungkyunkwan Univ, Dept Stat, Seoul 03063, South Korea
[2] Yale Univ, Sch Med, Dept Internal Med, New Haven, CT 06510 USA
[3] Univ Notre Dame, Dept Appl & Computat Math & Stat, Notre Dame, IN 46556 USA
来源
BAYESIAN ANALYSIS | 2024年 / 19卷 / 03期
基金
新加坡国家研究基金会;
关键词
Bayesian hypothesis test; Bayes factor consistency; high-dimensional covariance matrix; optimal high-dimensional tests; COVARIANCE MATRICES; FEWER OBSERVATIONS; EQUALITY;
D O I
10.1214/23-BA1373
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We propose minimax optimal Bayesian two-sample tests for testing equality of high-dimensional mean vectors and covariance matrices between two populations. In many applications including genomics and medical imaging, it is natural to assume that only a few entries of two mean vectors or covariance matrices are different. Many existing tests that rely on aggregating the difference between empirical means or covariance matrices are not optimal or yield low power under such setups. Motivated by this, we develop Bayesian two-sample tests employing a divide-and-conquer idea, which is powerful especially when the differences between two populations are rare but large. The proposed two-sample tests manifest closed forms of Bayes factors and allow scalable computations even in high-dimensions. We prove that the proposed tests are consistent under relatively mild conditions compared to existing tests in the literature. Furthermore, the testable regions from the proposed tests turn out to be minimax optimal in terms of rates. Simulation studies show clear advantages of the proposed tests over other state-of-the-art methods in various scenarios. Our tests are also applied to the analysis of the gene expression data of two cancer data sets.
引用
收藏
页码:869 / 893
页数:25
相关论文
共 23 条
[1]  
Bai ZD, 1996, STAT SINICA, V6, P311
[2]   Two-sample test of high dimensional means under dependence [J].
Cai, T. Tony ;
Liu, Weidong ;
Xia, Yin .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (02) :349-372
[3]   Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings [J].
Cai, Tony ;
Liu, Weidong ;
Xia, Yin .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) :265-277
[4]   A Constrained l1 Minimization Approach to Sparse Precision Matrix Estimation [J].
Cai, Tony ;
Liu, Weidong ;
Luo, Xi .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :594-607
[5]   Adaptive Thresholding for Sparse Covariance Matrix Estimation [J].
Cai, Tony ;
Liu, Weidong .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :672-684
[6]   Two-sample tests of high-dimensional means for compositional data [J].
Cao, Yuanpei ;
Lin, Wei ;
Li, Hongzhe .
BIOMETRIKA, 2018, 105 (01) :115-132
[7]   A Two-Sample Test for Equality of Means in High Dimension [J].
Gregory, Karl Bruce ;
Carroll, Raymond J. ;
Baladandayuthapani, Veerabhadran ;
Lahiri, Soumendra N. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (510) :837-849
[8]   HIGH-DIMENSIONAL TWO-SAMPLE COVARIANCE MATRIX TESTING VIA SUPER-DIAGONALS [J].
He, Jing ;
Chen, Song Xi .
STATISTICA SINICA, 2018, 28 (04) :2671-2696
[9]  
Jeffreys H., 1998, THEORY PROBABILITY
[10]   BAYES FACTORS [J].
KASS, RE ;
RAFTERY, AE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (430) :773-795