Controlling the false-discovery rate in astrophysical data analysis

被引:113
作者
Miller, CJ
Genovese, C
Nichol, RC
Wasserman, L
Connolly, A
Reichart, D
Hopkins, A
Schneider, J
机构
[1] Carnegie Mellon Univ, Dept Phys, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Dept Stat, Pittsburgh, PA 15213 USA
[3] Univ Pittsburgh, Dept Phys & Astron, Pittsburgh, PA 15260 USA
[4] CALTECH, Dept Astron, Pasadena, CA 91125 USA
[5] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
methods : analytical; methods : data analysis; methods : statistical; techniques : image processing;
D O I
10.1086/324109
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The false-discovery rate (FDR) is a new statistical procedure to control the number of mistakes made when performing multiple hypothesis tests, i.e., when comparing many data against a given model hypothesis. The key advantage of FDR is that it allows one to a priori control the average fraction of false rejections made (when comparing with the null hypothesis) over the total number of rejections performed. We compare FDR with the standard procedure of rejecting all tests that do not match the null hypothesis above some arbitrarily chosen confidence limit, e.g., 2 sigma, or at the 95% confidence level. We find a similar rate of correct detections, but with significantly fewer false detections. Moreover, the FDR procedure is quick and easy to compute and can be trivially adapted to work with correlated data. The purpose of this paper is to introduce the FDR procedure to the astrophysics community. We illustrate the power of FDR through several astronomical examples, including the detection of features against a smooth one-dimensional function, e.g., seeing the "baryon wiggles" in a power spectrum of matter fluctuations, and source pixel detection in imaging data. In this era of large data sets and high-precision measurements, FDR provides the means to adaptively control a scientifically meaningful quantity-the fraction of false discoveries over total discoveries.
引用
收藏
页码:3492 / 3505
页数:14
相关论文
共 11 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]  
BENJAMINI Y, 2001, IN PRESS ANN STAT
[3]  
Casella G., 2021, STAT INFERENCE
[4]  
GENOVESE C, 2001, UNPUB J R STAT SOC B
[5]  
HOPKINS AM, 2002, IN PRESS AJ
[6]   A high spatial resolution analysis of the MAXIMA-1 cosmic microwave background anisotropy data [J].
Lee, AT ;
Ade, P ;
Balbi, A ;
Bock, J ;
Borrill, J ;
Boscaleri, A ;
de Bernardis, P ;
Ferreira, PG ;
Hanany, S ;
Hristov, VV ;
Jaffe, AH ;
Mauskopf, PD ;
Netterfield, CB ;
Pascale, E ;
Rabii, B ;
Richards, PL ;
Smoot, GF ;
Stompor, R ;
Winant, CD ;
Wu, JHP .
ASTROPHYSICAL JOURNAL, 2001, 561 (01) :L1-L5
[7]   Possible detection of baryonic fluctuations in the large-scale structure power spectrum [J].
Miller, CJ ;
Nichol, RC ;
Batuski, DJ .
ASTROPHYSICAL JOURNAL, 2001, 555 (01) :68-73
[8]  
NETTERFIELD CB, 2001, UNPUB APJ
[9]   A line-of-sight integration approach to cosmic microwave background anisotropies [J].
Seljak, U ;
Zaldarriaga, M .
ASTROPHYSICAL JOURNAL, 1996, 469 (02) :437-444
[10]   Simultaneous multicolor detection of faint galaxies in the Hubble Deep Field [J].
Szalay, AS ;
Connolly, AJ ;
Szokoly, GP .
ASTRONOMICAL JOURNAL, 1999, 117 (01) :68-74