Motor insurance claim modelling with factor collapsing and Bayesian model averaging

被引:5
作者
Hu, Sen [1 ,2 ]
O'Hagan, Adrian [1 ,2 ]
Murphy, Thomas Brendan [1 ,2 ]
机构
[1] Univ Coll Dublin, Sch Math & Stat, Dublin 4, Ireland
[2] Univ Coll Dublin, Insight Ctr Data Analyt, Dublin 4, Ireland
来源
STAT | 2018年 / 7卷 / 01期
基金
爱尔兰科学基金会;
关键词
Bayesian model averaging; categorical variable selection; clustering; factor collapsing; general insurance pricing; generalized linear model; GRAPHICAL MODELS; REGRESSION; SELECTION; UNCERTAINTY;
D O I
10.1002/sta4.180
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
While generalized linear models have become the insurance industry's standard approach for claim modelling, the approach of utilizing a single best model on which predictions are based ignores model selection uncertainty. An additional feature of insurance claim data sets is the common presence of categorical variables, within which the number of levels is high, and not all levels may be statistically significant. In such cases, some subsets of the levels may be merged to give a smaller overall number of levels for improved model parsimony and interpretability. Hence, clustering of the levels poses an additional model uncertainty issue. A method is proposed for assessing the optimal manner of collapsing factors with many levels into factors with smaller numbers of levels, and Bayesian model averaging is used to blend model predictions from all reasonable models to account for selection uncertainty. This method will be computationally intensive when the number of factors being collapsed or the number of levels within factors increases. Hence, a stochastic approach is used to quickly identify the best collapsing cases across the model space. Copyright (c) 2018 John Wiley & Sons, Ltd.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Semiparametric model averaging prediction: a Bayesian approach
    Wang, Jingli
    Li, Jialiang
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2018, 60 (04) : 407 - 422
  • [22] Evaluating Predictors of Dispersion: A Comparison of Dominance Analysis and Bayesian Model Averaging
    Shou, Yiyun
    Smithson, Michael
    PSYCHOMETRIKA, 2015, 80 (01) : 236 - 256
  • [23] Endogeneity and panel data in growth regressions: A Bayesian model averaging approach
    Leon-Gonzalez, Roberto
    Montolio, Daniel
    JOURNAL OF MACROECONOMICS, 2015, 46 : 23 - 39
  • [24] A study of uncertainties in groundwater vulnerability modelling using Bayesian model averaging (BMA)
    Gharekhani, Maryam
    Nadiri, Ata Allah
    Khatibi, Rahman
    Sadeghfam, Sina
    Moghaddam, Asghar Asghari
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2022, 303
  • [25] Bayesian model averaging: A tutorial
    Hoeting, JA
    Madigan, D
    Raftery, AE
    Volinsky, CT
    STATISTICAL SCIENCE, 1999, 14 (04) : 382 - 401
  • [26] Modelling tree height-diameter allometry of Chinese fir in relation to stand and climate variables through Bayesian model averaging approach
    Lu, Lele
    Chhin, Sophan
    Zhang, Jianguo
    Zhang, Xiongqing
    SILVA FENNICA, 2021, 55 (02)
  • [27] Identifying groups of determinants in Bayesian model averaging using Dirichlet process clustering
    Grun, Bettina
    Hofmarcher, Paul
    SCANDINAVIAN JOURNAL OF STATISTICS, 2021, 48 (03) : 1018 - 1045
  • [28] Comparative Risk Assessment for Fossil Energy Chains Using Bayesian Model Averaging
    Spada, Matteo
    Burgherr, Peter
    ENERGIES, 2020, 13 (02)
  • [29] Credal Model Averaging: An Extension of Bayesian Model Averaging to Imprecise Probabilities
    Corani, Giorgio
    Zaffalon, Marco
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART I, PROCEEDINGS, 2008, 5211 : 257 - 271
  • [30] Sewer Structural Condition Prediction Integrating Bayesian Model Averaging with Logistic Regression
    Kabir, Golam
    Balek, Ngandu Balekelay Celestin
    Tesfamariam, Solomon
    JOURNAL OF PERFORMANCE OF CONSTRUCTED FACILITIES, 2018, 32 (03)