P-value calibration for multiple testing problems in genomics

被引:3
作者
Ferguson, John P. [1 ]
Palejev, Dean [2 ]
机构
[1] Univ Limerick, Dept Nephrol, Grad Entry Med Sch, Limerick, Ireland
[2] Bulgarian Acad Sci, Inst Math & Informat, BU-1113 Sofia, Bulgaria
关键词
calibration; conditional likelihood; conservative test; FDR correction; p-value; RNA-SEQ; WIDE ASSOCIATION; CHIP-SEQ; STATISTICAL-METHODS;
D O I
10.1515/sagmb-2013-0074
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Conservative statistical tests are often used in complex multiple testing settings in which computing the type I error may be difficult. In such tests, the reported p-value for a hypothesis can understate the evidence against the null hypothesis and consequently statistical power may be lost. False Discovery Rate adjustments, used in multiple comparison settings, can worsen the unfavorable effect. We present a computationally efficient and test-agnostic calibration technique that can substantially reduce the conservativeness of such tests. As a consequence, a lower sample size might be sufficient to reject the null hypothesis for true alternatives, and experimental costs can be lowered. We apply the calibration technique to the results of DESeq, a popular method for detecting differentially expressed genes from RNA sequencing data. The increase in power may be particularly high in small sample size experiments, often used in preliminary experiments and funding applications.
引用
收藏
页码:659 / 673
页数:15
相关论文
共 32 条
[1]  
ANDERS S., 2013, Differential Expression of RNA-Seq Data At the Gene Level - the DESeq Package
[2]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[3]  
Atanassov E., MATH IND IN PRESS
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   Integration of 198 ChIP-seq Datasets Reveals Human cis-Regulatory Regions [J].
Bolouri, Hamid ;
Ruzzo, Walter L. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (09) :989-997
[6]   Evaluating Gene Expression in C57BL/6J and DBA/2J Mouse Striatum Using RNA-Seq and Microarrays [J].
Bottomly, Daniel ;
Walter, Nicole A. R. ;
Hunter, Jessica Ezzell ;
Darakjian, Priscila ;
Kawane, Sunita ;
Buck, Kari J. ;
Searles, Robert P. ;
Mooney, Michael ;
McWeeney, Shannon K. ;
Hitzemann, Robert .
PLOS ONE, 2011, 6 (03)
[7]   Conservation of an RNA regulatory map between Drosophila and mammals [J].
Brooks, Angela N. ;
Yang, Li ;
Duff, Michael O. ;
Hansen, Kasper D. ;
Park, Jung W. ;
Dudoit, Sandrine ;
Brenner, Steven E. ;
Graveley, Brenton R. .
GENOME RESEARCH, 2011, 21 (02) :193-202
[8]   GENETIC SCREENS RNA-seq into the toolkit [J].
Burgess, Darren J. .
NATURE REVIEWS GENETICS, 2013, 14 (03) :154-155
[9]   Prioritizing GWAS Results: A Review of Statistical Methods and Recommendations for Their Application [J].
Cantor, Rita M. ;
Lange, Kenneth ;
Sinsheimer, Janet S. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (01) :6-22
[10]  
Casella G., 2002, Statistical inference, V2nd edition