Generalized pareto regression trees for extreme event analysis

被引:0
作者
Farkas, Sebastien [1 ]
Heranval, Antoine [1 ,2 ,3 ]
Lopez, Olivier [1 ,3 ]
Thomas, Maud [1 ]
机构
[1] Sorbonne Univ, CNRS, Lab Probabil Stat & Modelisat, 4 Pl Jussieu, F-75005 Paris, France
[2] Mission Risques Nat, 1 rue Jules Lefebvre, F-75009 Paris, France
[3] CNRS, Ecole Polytech, Inst Polytech Paris, CREST Lab,Grp Ecoles Natl Econ & Stat, 5 Ave Henry Chatelier, F-91120 Palaiseau, France
关键词
Extreme value theory; Regression trees; Concentration inequalities; Generalized pareto distribution; CONDITIONAL QUANTILES; CLASSIFICATION; MODELS; CONSISTENCY; SELECTION;
D O I
10.1007/s10687-024-00485-1
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper derives finite sample results to assess the consistency of Generalized Pareto regression trees introduced by Farkas et al. (Insur. Math. Econ. 98:92-105, 2021) as tools to perform extreme value regression for heavy-tailed distributions. This procedure allows the constitution of classes of observations with similar tail behaviors depending on the value of the covariates, based on a recursive partition of the sample and simple model selection rules. The results we provide are obtained from concentration inequalities, and are valid for a finite sample size. A misspecification bias that arises from the use of a "Peaks over Threshold" approach is also taken into account. Moreover, the derived properties legitimate the pruning strategies, that is the model selection rules, used to select a proper tree that achieves a compromise between simplicity and goodness-of-fit. The methodology is illustrated through a simulation study, and a real data application in insurance for natural disasters.
引用
收藏
页码:437 / 477
页数:41
相关论文
共 46 条
  • [11] Chaudhuri P, 2002, BERNOULLI, V8, P561
  • [12] Asymptotic consistency of median regression trees
    Chaudhuri, P
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2000, 91 (02) : 229 - 238
  • [13] An Extreme Value Approach for Modeling Operational Risk Losses Depending on Covariates
    Chavez-Demoulin, Valerie
    Embrechts, Paul
    Hofert, Marius
    [J]. JOURNAL OF RISK AND INSURANCE, 2016, 83 (03) : 735 - 776
  • [14] Extremal quantile regression
    Chernozhukov, V
    [J]. ANNALS OF STATISTICS, 2005, 33 (02) : 806 - 839
  • [15] Coles S., 2001, An Introduction to Statistical Modeling of Extreme Values, DOI [10.1007/978-1-4471-3675-0, DOI 10.1007/978-1-4471-3675-0]
  • [16] DAVISON AC, 1990, J ROY STAT SOC B MET, V52, P393
  • [17] De'ath G, 2000, ECOLOGY, V81, P3178, DOI 10.2307/177409
  • [18] Uniform in bandwidth consistency of kernel-type function estimators
    Einmahl, U
    Mason, DM
    [J]. ANNALS OF STATISTICS, 2005, 33 (03) : 1380 - 1403
  • [19] Embrechts P, 2013, Modelling extremal events: for insurance and finance, V33, DOI DOI 10.1007/978-3-642-33483-2
  • [20] Cyber claim analysis using Generalized Pareto regression trees with applications to insurance
    Farkas, Sebastien
    Lopez, Olivier
    Thomas, Maud
    [J]. INSURANCE MATHEMATICS & ECONOMICS, 2021, 98 : 92 - 105