Covariate-modulated large-scale multiple testing under dependence

被引：2

作者：

Wang, Jiangzhou ^{[1
]}

Cui, Tingting ^{[2
]}

Zhu, Wensheng ^{[2
]}

Wang, Pengfei ^{[3
]}

机构：

[1] Shenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R China

[2] Northeast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun, Peoples R China

[3] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R China

来源：

COMPUTATIONAL STATISTICS & DATA ANALYSIS | 2023年 / 180卷

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Covariate-modulated HMM; FDR; Local correlations; Large-scale multiple testing; FALSE DISCOVERY RATE; HIDDEN MARKOV-MODELS; GENOME-WIDE ASSOCIATION; MIXTURES; NUMBER;

D O I：

10.1016/j.csda.2022.107664

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Large-scale multiple testing, which calls for conducting tens of thousands of hypothesis testings simultaneously, has been applied in many scientific fields. Most conventional multiple testing procedures often focused on the control of false discovery rate (FDR) and largely ignored covariate information and the dependence structure among tests. A FDR control procedure, termed as Covariate-Modulated Local Index of Significance (cmLIS) procedure, which not only takes into account local correlations among tests but also accommodates the covariate information by leveraging a covariate-modulated hidden Markov model (HMM), has been proposed. In the oracle case where all parameters of the covariate-modulated HMM are known, the cmLIS procedure is shown to be valid and optimal in some sense. According to whether the number of mixed components in the nonnull distribution is known, two Bayesian sampling algorithms are provided for parameter estimation. Extensive simulations are conducted to demonstrate the effectiveness of the cmLIS procedure over state-of-the-art multiple testing procedures. Finally, the cmLIS procedure is applied to an RNA sequencing data and a schizophrenia (SCZ) data. (c) 2022 Elsevier B.V. All rights reserved.

引用

页数：15

共 50 条

[21] LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION
Jernigan, Robert
Jia, Kejue
Ren, Zhao
Zhou, Wen
ANNALS OF APPLIED STATISTICS, 2021, 15 (02) : 902 - 924
[22] Large-scale dependent multiple testing via hidden semi-Markov models
Wang, Jiangzhou
Wang, Pengfei
COMPUTATIONAL STATISTICS, 2024, 39 (03) : 1093 - 1126
[23] Global Testing and Large-Scale Multiple Testing for High-Dimensional Covariance Structures
Cai, T. Tony
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 4, 2017, 4 : 423 - 446
[24] Post hoc power estimation in large-scale multiple testing problems
Zehetmayer, Sonja
Posch, Martin
BIOINFORMATICS, 2010, 26 (08) : 1050 - 1056
[25] Heteroscedasticity-Adjusted Ranking and Thresholding for Large-Scale Multiple Testing
Fu, Luella
Gang, Bowen
James, Gareth M.
Sun, Wenguang
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (538) : 1028 - 1040
[26] False discovery rates for large-scale model checking under certain dependence
Deng, Lu
Zi, Xuemin
Li, Zhonghua
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (01) : 64 - 79
[27] A Factor Model Approach to Multiple Testing Under Dependence
Friguet, Chloe
Kloareg, Maela
Causeur, David
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (488) : 1406 - 1415
[28] Assessing mean and median filters in multiple testing for large-scale imaging data
Chunming Zhang
TEST, 2014, 23 : 51 - 71
[29] Assessing mean and median filters in multiple testing for large-scale imaging data
Zhang, Chunming
TEST, 2014, 23 (01) : 51 - 71
[30] Multiple testing under negative dependence
Chi, Ziyu
Ramdas, Aaditya
Wang, Ruodu
BERNOULLI, 2025, 31 (02) : 1230 - 1255

← 1 2 3 4 5 →