Heteroscedasticity-Adjusted Ranking and Thresholding for Large-Scale Multiple Testing

被引:2
|
作者
Fu, Luella [1 ]
Gang, Bowen [2 ]
James, Gareth M. [3 ]
Sun, Wenguang [3 ]
机构
[1] San Francisco State Univ, Dept Math, San Francisco, CA 94132 USA
[2] Fudan Univ, Dept Stat, Shanghai, Peoples R China
[3] Univ Southern Calif, Dept Data Sci & Operat, Los Angeles, CA 90089 USA
关键词
Covariate-assisted inference; Data processing and information loss; False discovery rate; Heteroscedasticity; Multiple testing with side information; Structured multiple testing; FALSE-DISCOVERY RATE; GENE-EXPRESSION; EMPIRICAL BAYES; POWER; HYPOTHESES; NULL; MICROARRAYS;
D O I
10.1080/01621459.2020.1840992
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Standardization has been a widely adopted practice in multiple testing, for it takes into account the variability in sampling and makes the test statistics comparable across different study units. However, despite conventional wisdom to the contrary, we show that there can be a significant loss in information from basing hypothesis tests on standardized statistics rather than the full data. We develop a new class of heteroscedasticity-adjusted ranking and thresholding (HART) rules that aim to improve existing methods by simultaneously exploiting commonalities and adjusting heterogeneities among the study units. The main idea of HART is to bypass standardization by directly incorporating both the summary statistic and its variance into the testing procedure. A key message is that the variance structure of the alternative distribution, which is subsumed under standardized statistics, is highly informative and can be exploited to achieve higher power. The proposed HART procedure is shown to be asymptotically valid and optimal for false discovery rate (FDR) control. Our simulation results demonstrate that HART achieves substantial power gain over existing methods at the same FDR level. We illustrate the implementation through a microarray analysis of myeloma.
引用
收藏
页码:1028 / 1040
页数:13
相关论文
共 50 条
  • [41] UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
    Liu, Chengyou
    Zhou, Leilei
    Wang, Yuhe
    Tian, Shuchang
    Zhu, Junlin
    Qin, Hang
    Ding, Yong
    Jiang, Hongbing
    THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2019, 16 (01)
  • [42] Making the cut: improved ranking and selection for large-scale inference
    Henderson, Nicholas C.
    Newton, Michael A.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (04) : 781 - 804
  • [43] Large-Scale Simultaneous Testing Using Kernel Density Estimation
    Santu Ghosh
    Alan M. Polansky
    Sankhya A, 2022, 84 (2): : 808 - 843
  • [44] Large-Scale Simultaneous Testing Using Kernel Density Estimation
    Ghosh, Santu
    Polansky, Alan M.
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2022, 84 (02): : 808 - 843
  • [45] Optimal Control of Directional False Discovery Rates in Large-Scale Testing
    Tang, Guozhu
    Kang, Yicheng
    Xiang, Dongdong
    STATISTICS IN MEDICINE, 2025, 44 (05)
  • [46] MixTwice: large-scale hypothesis testing for peptide arrays by variance mixing
    Zheng, Zihao
    Mergaert, Aisha M.
    Ong, Irene M.
    Shelef, Miriam A.
    Newton, Michael A.
    BIOINFORMATICS, 2021, 37 (17) : 2637 - 2643
  • [47] Adaptive choice of the number of bootstrap samples in large scale multiple testing
    Guo, Wenge
    Peddada, Shyamal
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2008, 7 (01)
  • [48] Large-scale multiple testing in genome-wide association studies via region-specific hidden Markov models
    Jian Xiao
    Wensheng Zhu
    Jianhua Guo
    BMC Bioinformatics, 14
  • [49] Large-scale multiple testing in genome-wide association studies via region-specific hidden Markov models
    Xiao, Jian
    Zhu, Wensheng
    Guo, Jianhua
    BMC BIOINFORMATICS, 2013, 14
  • [50] LARGE-SCALE SIMULTANEOUS TESTING OF CROSS-COVARIANCE MATRICES WITH APPLICATIONS TO PheWAS
    Cai, Tianxi
    Cai, T. Tony
    Liao, Katherine
    Liu, Weidong
    STATISTICA SINICA, 2019, 29 (02) : 983 - 1005