The false discovery rate for statistical pattern recognition

被引：5

作者：

Scott, Clayton ^{[1
]}

Bellala, Gowtham ^{[1
]}

Willett, Rebecca ^{[2
]}

机构：

[1] Univ Michigan, Ann Arbor, MI 48109 USA

[2] Duke Univ, Durham, NC 27706 USA

来源：

ELECTRONIC JOURNAL OF STATISTICS | 2009年 / 3卷

基金：

美国国家科学基金会;

关键词：

Statistical learning theory; generalization error; false discovery rate; BOUNDS;

D O I：

10.1214/09-EJS363

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

The false discovery rate (FDR) and false nondiscovery rate (FNDR) have received considerable attention in the literature on multiple testing. These performance measures are also appropriate for classification, and in this work we develop generalization error analyses for FDR and FNDR when learning a classifier from labeled training data. Unlike more conventional classification performance measures, the empirical FDR and FNDR are not binomial random variables but rather a ratio of binomials, which introduces challenges not present in conventional formulations of the classification problem. We develop distribution-free uniform deviation bounds and apply these to obtain finite sample bounds and strong universal consistency. We also present a simulation study demonstrating the merits of variance-based bounds, which we also develop. In the context of multiple testing with FDR/FNDR, our frame work may be viewed as a way to leverage training data to achieve distribution free, asymptotically optimal inference under the random effects model.

引用

页码：651 / 677

页数：27

共 38 条

[1] Agarwal S, 2005, J MACH LEARN RES, V6, P393
[2] DIAGNOSTIC-TESTS-2 - PREDICTIVE VALUES .4.
ALTMAN, DG
BLAND, JM
[J]. BRITISH MEDICAL JOURNAL, 1994, 309 (6947) : 102 - 102
[3] [Anonymous], 1946, PROBABILITY THEORY
[4] [Anonymous], 1991, Probability: theory and examples
[5] [Anonymous], LAUR022951 LOS AL NA
[6] [Anonymous], 2013, A Probabilistic Theory of Pattern Recognition
[7] ARLOT S, 2007, LEARNING THEORY, V127, P141
[8] BACH FR, 2006, J MACHINE LEARNING R, P1713
[9] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
BENJAMINI, Y
HOCHBERG, Y
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
[10] BEYGELZIMER A, 2009, ERROR CORRECTING TOU

← 1 2 3 4 →