Null-free False Discovery Rate Control Using Decoy Permutations

被引:0
作者
Kun He
Meng-jie Li
Yan Fu
Fu-zhou Gong
Xiao-ming Sun
机构
[1] Chinese Academy of Sciences,Iinstitute of Computing Technology
[2] Chinese Academy of Sciences,CEMS, NCMIS, RCSDS, Academy of Mathematics and Systems Science
[3] University of Chinese Academy of Sciences,undefined
来源
Acta Mathematicae Applicatae Sinica, English Series | 2022年 / 38卷
关键词
multiple testing; false discovery rate; null distribution-free; -value-free; decoy permutations; knockoff filter; 62G10; 62H15;
D O I
暂无
中图分类号
学科分类号
摘要
The traditional approaches to false discovery rate (FDR) control in multiple hypothesis testing are usually based on the null distribution of a test statistic. However, all types of null distributions, including the theoretical, permutation-based and empirical ones, have some inherent drawbacks. For example, the theoretical null might fail because of improper assumptions on the sample distribution. Here, we propose a null distribution-free approach to FDR control for multiple hypothesis testing in the case-control study. This approach, named target-decoy procedure, simply builds on the ordering of tests by some statistic or score, the null distribution of which is not required to be known. Competitive decoy tests are constructed from permutations of original samples and are used to estimate the false target discoveries. We prove that this approach controls the FDR when the score function is symmetric and the scores are independent between different tests. Simulation demonstrates that it is more stable and powerful than two popular traditional approaches, even in the existence of dependency. Evaluation is also made on two real datasets, including an arabidopsis genomics dataset and a COVID-19 proteomics dataset.
引用
收藏
页码:235 / 253
页数:18
相关论文
共 118 条
[1]  
Almudevar A(2006)Utility of correlation measures in analysis of gene expression NeuroRx 3 384-395
[2]  
Klebanov LB(2015)Controlling the false discovery rate via knockoffs The Annals of Statistics 43 2055-2085
[3]  
Qiu X(2019)A knockoff filter for high-dimensional selective inference The Annals of Statistics 47 2504-2537
[4]  
Salzman P(2020)Robust inference with knockoffs The Annals of Statistics 48 1409-1431
[5]  
Yakovlev AY(2018)Weighted false discovery rate control in large-scale multiple testing Journal of the American Statistical Association 113 1172-1183
[6]  
Barber RF(1995)Controlling the false discovery rate: a practical and powerful approach to multiple testing Journal of the Royal statistical society: series B (Methodological) 57 289-300
[7]  
Candès E J(2006)Adaptive linear step-up procedures that control the false discovery rate Biometrika 93 491-507
[8]  
Barber RF(2001)The control of the false discovery rate in multiple testing under dependency Annals of statistics 29 1165-1188
[9]  
Candès EJ(2018)Panning for gold: model-x knockoffs for high dimensional controlled variable selection Journal of the Royal Statistical Society: Series B (Statistical Methodology) 80 551-577
[10]  
Barber RF(2020)Beyond target-decoy competition: Stable validation of peptide and protein identifications in mass spectrometry-based discovery proteomics Analytical Chemistry 92 14898-14906