Generalized pareto regression trees for extreme event analysis

被引:0
作者
Farkas, Sebastien [1 ]
Heranval, Antoine [1 ,2 ,3 ]
Lopez, Olivier [1 ,3 ]
Thomas, Maud [1 ]
机构
[1] Sorbonne Univ, CNRS, Lab Probabil Stat & Modelisat, 4 Pl Jussieu, F-75005 Paris, France
[2] Mission Risques Nat, 1 rue Jules Lefebvre, F-75009 Paris, France
[3] CNRS, Ecole Polytech, Inst Polytech Paris, CREST Lab,Grp Ecoles Natl Econ & Stat, 5 Ave Henry Chatelier, F-91120 Palaiseau, France
关键词
Extreme value theory; Regression trees; Concentration inequalities; Generalized pareto distribution; CONDITIONAL QUANTILES; CLASSIFICATION; MODELS; CONSISTENCY; SELECTION;
D O I
10.1007/s10687-024-00485-1
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper derives finite sample results to assess the consistency of Generalized Pareto regression trees introduced by Farkas et al. (Insur. Math. Econ. 98:92-105, 2021) as tools to perform extreme value regression for heavy-tailed distributions. This procedure allows the constitution of classes of observations with similar tail behaviors depending on the value of the covariates, based on a recursive partition of the sample and simple model selection rules. The results we provide are obtained from concentration inequalities, and are valid for a finite sample size. A misspecification bias that arises from the use of a "Peaks over Threshold" approach is also taken into account. Moreover, the derived properties legitimate the pruning strategies, that is the model selection rules, used to select a proper tree that achieves a compromise between simplicity and goodness-of-fit. The methodology is illustrated through a simulation study, and a real data application in insurance for natural disasters.
引用
收藏
页码:437 / 477
页数:41
相关论文
共 46 条
  • [1] RELATIONSHIP BETWEEN VARIABLE SELECTION AND DATA AUGMENTATION AND A METHOD FOR PREDICTION
    ALLEN, DM
    [J]. TECHNOMETRICS, 1974, 16 (01) : 125 - 127
  • [2] Allouche M., 2022, ESTIMATION OF EXTREM
  • [3] [Anonymous], 2016, VOLUMETOWEIGHT CONVE
  • [4] RESIDUAL LIFE TIME AT GREAT AGE
    BALKEMA, AA
    DEHAAN, L
    [J]. ANNALS OF PROBABILITY, 1974, 2 (05) : 792 - 804
  • [5] A penalised piecewise-linear model for non-stationary extreme value analysis of peaks over threshold
    Barlow, Anna Maria
    Mackay, Ed
    Eastoe, Emma
    Jonathan, Philip
    [J]. OCEAN ENGINEERING, 2023, 267
  • [6] Local polynomial maximum likelihood estimation for Pareto-type distributions
    Beirlant, J
    Goegebeur, Y
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 89 (01) : 97 - 118
  • [7] Beirlant J., 2004, STAT EXTREMES THEORY, DOI DOI 10.1002/0470012382
  • [8] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [9] Stochastic downscaling of precipitation with neural network conditional mixture models
    Carreau, Julie
    Vrac, Mathieu
    [J]. WATER RESOURCES RESEARCH, 2011, 47
  • [10] Insurance against natural catastrophes: balancing actuarial fairness and social solidarity
    Charpentier, Arthur
    Barry, Laurence
    James, Molly R.
    [J]. GENEVA PAPERS ON RISK AND INSURANCE-ISSUES AND PRACTICE, 2022, 47 (01) : 50 - 78