An RIHT statistic for testing the equality of several high-dimensional mean vectors under homoskedasticity

被引:1
作者
Zhang, Qiuyan [1 ]
Wang, Chen [2 ]
Zhang, Baoxue [1 ]
Yang, Hu [3 ]
机构
[1] Capital Univ Econ & Business, Sch Stat, Beijing, Peoples R China
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
[3] Cent Univ Finance & Econ, Sch Informat, Beijing, Peoples R China
关键词
Exact four -moment theorem; Central limit theorem; Mean vector test; High -dimensional data analysis; HOTELLINGS T-2 TEST; COVARIANCE MATRICES; 2-SAMPLE TEST; CANCER;
D O I
10.1016/j.csda.2023.107855
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this article, the problem of testing the equality of several mean vectors is considered under the homoskedasticity in a high-dimensional setting. A ridgelized Hotelling's T-2 test (RIHT) is developed and the asymptotic distributions are derived. By requiring only the conditions on the first four moments of the underlying distribution, the RIHT test can be used to test the mean vector free of population distributions under both p >= n and p < n and improve the power of the classic Hotelling's T-2 test. The innovations of the proposed statistic include the following: (1) the RIHT statistic is derived in accordance with a penalized likelihood ratio test; (2) the exact four-moment theorem of the RIHT test makes it possible to test data with an arbitrary distribution; and (3) the proposed statistic is less sensitive to highly correlated data from simulations due to the penalty imposed on concentration matrix. Simulations and real data applications show that the RIHT test performs well and is more powerful than alternatives.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:28
相关论文
共 31 条
  • [1] Bai Z, 2010, SPRINGER SER STAT, P1, DOI 10.1007/978-1-4419-0661-8
  • [2] Bai ZD, 2004, ANN PROBAB, V32, P553
  • [3] Bai ZD, 1996, STAT SINICA, V6, P311
  • [4] Bai ZD, 2008, STAT SINICA, V18, P425
  • [5] A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes
    Baldi, P
    Long, AD
    [J]. BIOINFORMATICS, 2001, 17 (06) : 509 - 519
  • [6] Two-sample test of high dimensional means under dependence
    Cai, T. Tony
    Liu, Weidong
    Xia, Yin
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (02) : 349 - 372
  • [7] A Regularized Hotelling's T2 Test for Pathway Analysis in Proteomic Studies
    Chen, Lin S.
    Paul, Debashis
    Prentice, Ross L.
    Wang, Pei
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) : 1345 - 1360
  • [8] A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING
    Chen, Song Xi
    Qin, Ying-Li
    [J]. ANNALS OF STATISTICS, 2010, 38 (02) : 808 - 835
  • [9] An Independent Filter for Gene Set Testing Based on Spectral Enrichment
    Frost, H. Robert
    Li, Zhigang
    Asselbergs, Folkert W.
    Moore, Jason H.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (05) : 1076 - 1086
  • [10] Fujikoshi Y., 2004, J JAPAN STAT SOC, V34, P19