PAC-Bayesian AUC classification and scoring

被引:0
作者
Ridgway, James [1 ,2 ]
Alquier, Pierre [3 ]
Chopin, Nicolas [3 ,4 ]
Liang, Feng [5 ]
机构
[1] Univ Dauphine, CREST, Paris, France
[2] Univ Dauphine, CEREMADE, Paris, France
[3] ENSAE, CREST, Paris, France
[4] HEC Paris, Paris, France
[5] Univ Illinois, Champaign, IL USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014) | 2014年 / 27卷
关键词
VARIABLE SELECTION; BOUNDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a scoring and classification procedure based on the PAC-Bayesian approach and the AUC (Area Under Curve) criterion. We focus initially on the class of linear score functions. We derive PAC-Bayesian non-asymptotic bounds for two types of prior for the score parameters: a Gaussian prior, and a spike-and-slab prior; the latter makes it possible to perform feature selection. One important advantage of our approach is that it is amenable to powerful Bayesian computational tools. We derive in particular a Sequential Monte Carlo algorithm, as an efficient method which may be used as a gold standard, and an Expectation-Propagation algorithm, as a much faster but approximate method. We also extend our method to a class of non-linear score functions, essentially leading to a nonparametric procedure, by considering a Gaussian process prior.
引用
收藏
页数:9
相关论文
共 26 条
[1]   PAC-Bayesian Bounds for Randomized Empirical Risk Minimizers [J].
Alquier, P. .
MATHEMATICAL METHODS OF STATISTICS, 2008, 17 (04) :279-304
[2]  
Alquier P, 2013, J MACH LEARN RES, V14, P243
[3]  
[Anonymous], 2003, Journal of machine learning research
[4]  
[Anonymous], 2005, EXPECTATION PROPAGAT
[5]  
[Anonymous], THESIS
[6]  
[Anonymous], 2003, INT C MACHINE LEARNI
[7]  
[Anonymous], 2007, IMS LECT NOTES MONOG
[8]  
Buhlmann P., 2011, STAT HIGH DIMENSIONN
[9]   Ranking and empirical minimization of U-statistics [J].
Clemencon, Stephan ;
Lugosi, Gabor ;
Vayatis, Nicolas .
ANNALS OF STATISTICS, 2008, 36 (02) :844-874
[10]  
Clemencon Stephan, 2008, Journal of Biological Dynamics, V2, P392, DOI 10.1080/17513750801993266