Covariate-modulated large-scale multiple testing under dependence

被引:2
|
作者
Wang, Jiangzhou [1 ]
Cui, Tingting [2 ]
Zhu, Wensheng [2 ]
Wang, Pengfei [3 ]
机构
[1] Shenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R China
[2] Northeast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun, Peoples R China
[3] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Covariate-modulated HMM; FDR; Local correlations; Large-scale multiple testing; FALSE DISCOVERY RATE; HIDDEN MARKOV-MODELS; GENOME-WIDE ASSOCIATION; MIXTURES; NUMBER;
D O I
10.1016/j.csda.2022.107664
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Large-scale multiple testing, which calls for conducting tens of thousands of hypothesis testings simultaneously, has been applied in many scientific fields. Most conventional multiple testing procedures often focused on the control of false discovery rate (FDR) and largely ignored covariate information and the dependence structure among tests. A FDR control procedure, termed as Covariate-Modulated Local Index of Significance (cmLIS) procedure, which not only takes into account local correlations among tests but also accommodates the covariate information by leveraging a covariate-modulated hidden Markov model (HMM), has been proposed. In the oracle case where all parameters of the covariate-modulated HMM are known, the cmLIS procedure is shown to be valid and optimal in some sense. According to whether the number of mixed components in the nonnull distribution is known, two Bayesian sampling algorithms are provided for parameter estimation. Extensive simulations are conducted to demonstrate the effectiveness of the cmLIS procedure over state-of-the-art multiple testing procedures. Finally, the cmLIS procedure is applied to an RNA sequencing data and a schizophrenia (SCZ) data. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Large-scale multiple testing under dependence
    Sun, Wenguang
    Cai, T. Tony
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 393 - 424
  • [2] Factor Analysis for Multiple Testing (FAMT): An R Package for Large-Scale Significance Testing under Dependence
    Causeur, David
    Friguet, Chloe
    Houee-Bigot, Magalie
    Kloareg, Maela
    JOURNAL OF STATISTICAL SOFTWARE, 2011, 40 (14): : 1 - 19
  • [3] Large-scale covariate-assisted two-sample inference under dependence
    Wang, Pengfei
    Zhu, Wensheng
    SCANDINAVIAN JOURNAL OF STATISTICS, 2022, 49 (04) : 1421 - 1447
  • [4] False discovery control in large-scale spatial multiple testing
    Sun, Wenguang
    Reich, Brian J.
    Cai, T. Tony
    Guindani, Michele
    Schwartzman, Armin
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2015, 77 (01) : 59 - 83
  • [5] Large-Scale Multiple Testing of Correlations
    Cai, T. Tony
    Liu, Weidong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) : 229 - 240
  • [6] Large-scale dependent multiple testing via higher-order hidden Markov models
    Li, Canhui
    Wang, Jiangzhou
    Wang, Pengfei
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2024,
  • [7] Fast and covariate-adaptive method amplifies detection power in large-scale multiple hypothesis testing
    Zhang, Martin J.
    Xia, Fei
    Zou, James
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [8] Large-scale multiple testing via multivariate hidden Markov models
    Hou, Zhiqiang
    Wang, Pengfei
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (04) : 1932 - 1951
  • [9] Bayesian hidden Markov models for dependent large-scale multiple testing
    Wang, Xia
    Shojaie, Ali
    Zou, Jian
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 136 : 123 - 136
  • [10] Extended likelihood approach to large-scale multiple testing
    Lee, Youngjo
    Bjornstad, Jan F.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (03) : 553 - 575