Large-scale dependent multiple testing via higher-order hidden Markov models

被引:3
作者
Li, Canhui [1 ]
Wang, Jiangzhou [2 ]
Wang, Pengfei [3 ]
机构
[1] Henan Univ, Sch Math & Stat, Kaifeng, Peoples R China
[2] Shenzhen Univ, Inst Stat Sci, Coll Math & Stat, Shenzhen, Peoples R China
[3] Dongbei Univ Finance & Econ, Sch Stat, 217 Jianshan St, Dalian 116025, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
FDR; local correlations; multiple testing; the higher-order HMM; FALSE DISCOVERY RATE; GENOME-WIDE ASSOCIATION;
D O I
10.1080/10543406.2024.2420657
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Taking into account the local dependence structure in large-scale multiple testing is expected to improve both the efficiency of the testing procedure and the interpretability of scientific findings. The hidden Markov model (HMM), as an effective model to describe the sequential dependence, has been successfully applied to large-scale multiple testing with local correlations. However, in many applications, the first-order Markov chain is not flexible enough to capture the complexity of local correlations. To address this issue, this paper proposes a novel multiple testing procedure that uses a higher-order Markov chain to better characterize local correlations among tests. The proposed procedure is validated by theoretical results and simulation studies, which show that it outperforms its competitors in terms of power. Finally, a real data analysis is presented to demonstrate the favorable performance of the proposed procedure.
引用
收藏
页数:13
相关论文
共 29 条
[1]  
Abraham K, 2022, J MACH LEARN RES, V23
[2]   On the adaptive control of the false discovery fate in multiple testing with independent statistics [J].
Benjamini, Y ;
Hochberg, Y .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2000, 25 (01) :60-83
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   Covariate-assisted ranking and screening for large-scale two-sample inference [J].
Cai, T. Tony ;
Sun, Wenguang ;
Wang, Weinan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2019, 81 (02) :187-234
[5]   Covariate-adjusted multiple testing in genome-wide association studies via factorial hidden Markov models [J].
Cui, Tingting ;
Wang, Pengfei ;
Zhu, Wensheng .
TEST, 2021, 30 (03) :737-757
[6]   Two-group Poisson-Dirichlet mixtures for multiple testing [J].
Denti, Francesco ;
Guindani, Michele ;
Leisen, Fabrizio ;
Lijoi, Antonio ;
Wadsworth, William Duncan ;
Vannucci, Marina .
BIOMETRICS, 2021, 77 (02) :622-633
[7]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[8]   Correlation and large-scale simultaneous significance testing [J].
Efron, Bradley .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (477) :93-103
[9]   A stochastic process approach to false discovery control [J].
Genovese, C ;
Wasserman, L .
ANNALS OF STATISTICS, 2004, 32 (03) :1035-1061
[10]   Operating characteristics and extensions of the false discovery rate procedure [J].
Genovese, C ;
Wasserman, L .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 :499-517