On Tail Decay Rate Estimation of Loss Function Distributions

被引:0
作者
Haxholli, Etrit [1 ]
Lorenzi, Marco [1 ]
机构
[1] Univ Cte Azur, Inria, Epione Res Grp, 2004 Rte Lucioles, F-06902 Valbonne, France
关键词
Extreme Value Theory; Tail Modelling; Peaks-Over-Threshold; Cross-Tail-; Estimation; Model Ranking; EXTREME; SELECTION; CHOICE; INDEX; MODEL; TIME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The study of loss -function distributions is critical to characterize a model's behaviour on a given machine -learning problem. While model quality is commonly measured by the average loss assessed on a testing set, this quantity does not ascertain the existence of the mean of the loss distribution. Conversely, the existence of a distribution's statistical moments can be verified by examining the thickness of its tails. Cross -validation schemes determine a family of testing loss distributions conditioned on the training sets. By marginalizing across training sets, we can recover the overall (marginal) loss distribution, whose tail -shape we aim to estimate. Small sample -sizes diminish the reliability and efficiency of classical tail -estimation methods like Peaks -OverThreshold, and we demonstrate that this effect is notably significant when estimating tails of marginal distributions composed of conditional distributions with substantial taillocation variability. We mitigate this problem by utilizing a result we prove: under certain conditions, the marginal-distribution's tail -shape parameter is the maximum tail -shape parameter across the conditional distributions underlying the marginal. We label the resulting approach as 'cross -tail estimation (CTE)'. We test CTE in a series of experiments on simulated and real data1, showing the improved robustness and quality of tail estimation as compared to classical approaches.
引用
收藏
页码:1 / 47
页数:47
相关论文
共 27 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Akaike H., 1998, Selected Papers of Hirotugu Akaike, P199, DOI DOI 10.1007/978-1-4612-1694-015
[3]   RELATIONSHIP BETWEEN VARIABLE SELECTION AND DATA AUGMENTATION AND A METHOD FOR PREDICTION [J].
ALLEN, DM .
TECHNOMETRICS, 1974, 16 (01) :125-127
[4]  
[Anonymous], 2007, Springer Series in Operations Research and Financial Engineering
[5]  
Arlot S, 2009, J MACH LEARN RES, V10, P245
[6]   RESIDUAL LIFE TIME AT GREAT AGE [J].
BALKEMA, AA ;
DEHAAN, L .
ANNALS OF PROBABILITY, 1974, 2 (05) :792-804
[7]   ESTIMATION OF INTEGRAL FUNCTIONALS OF A DENSITY [J].
BIRGE, L ;
MASSART, P .
ANNALS OF STATISTICS, 1995, 23 (01) :11-29
[8]  
Burnham K. P., 2007, Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach
[9]  
Cochran W.G., 1963, SAMPLING TECHNIQUES, V3rd ed.
[10]   TAIL ESTIMATES MOTIVATED BY EXTREME VALUE THEORY [J].
DAVIS, R ;
RESNICK, S .
ANNALS OF STATISTICS, 1984, 12 (04) :1467-1487