Motor insurance claim modelling with factor collapsing and Bayesian model averaging

被引:5
作者
Hu, Sen [1 ,2 ]
O'Hagan, Adrian [1 ,2 ]
Murphy, Thomas Brendan [1 ,2 ]
机构
[1] Univ Coll Dublin, Sch Math & Stat, Dublin 4, Ireland
[2] Univ Coll Dublin, Insight Ctr Data Analyt, Dublin 4, Ireland
来源
STAT | 2018年 / 7卷 / 01期
基金
爱尔兰科学基金会;
关键词
Bayesian model averaging; categorical variable selection; clustering; factor collapsing; general insurance pricing; generalized linear model; GRAPHICAL MODELS; REGRESSION; SELECTION; UNCERTAINTY;
D O I
10.1002/sta4.180
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
While generalized linear models have become the insurance industry's standard approach for claim modelling, the approach of utilizing a single best model on which predictions are based ignores model selection uncertainty. An additional feature of insurance claim data sets is the common presence of categorical variables, within which the number of levels is high, and not all levels may be statistically significant. In such cases, some subsets of the levels may be merged to give a smaller overall number of levels for improved model parsimony and interpretability. Hence, clustering of the levels poses an additional model uncertainty issue. A method is proposed for assessing the optimal manner of collapsing factors with many levels into factors with smaller numbers of levels, and Bayesian model averaging is used to blend model predictions from all reasonable models to account for selection uncertainty. This method will be computationally intensive when the number of factors being collapsed or the number of levels within factors increases. Hence, a stochastic approach is used to quickly identify the best collapsing cases across the model space. Copyright (c) 2018 John Wiley & Sons, Ltd.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A Review of Bayesian Model Averaging
    Hua Peng
    Zhao Xuemin
    DATA PROCESSING AND QUANTITATIVE ECONOMY MODELING, 2010, : 32 - +
  • [2] Modelling Motor Insurance Claim Frequency and Severity Using Gradient Boosting
    Clemente, Carina
    Guerreiro, Gracinda R.
    Bravo, Jorge M.
    RISKS, 2023, 11 (09)
  • [3] Bayesian Model Averaging of Chain Event Graphs for Robust Explanatory Modelling
    Strong, Peter
    Smith, Jim Q.
    INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 186, 2022, 186
  • [4] Extended Bayesian model averaging for heritability in twin studies
    Tsai, Miao-Yu
    JOURNAL OF APPLIED STATISTICS, 2010, 37 (06) : 1043 - 1058
  • [5] Bayesian Model Averaging: A Systematic Review and Conceptual Classification
    Fragoso, Tiago M.
    Bertoli, Wesley
    Louzada, Francisco
    INTERNATIONAL STATISTICAL REVIEW, 2018, 86 (01) : 1 - 28
  • [6] Bayesian Additive Regression Trees using Bayesian model averaging
    Hernandez, Belinda
    Raftery, Adrian E.
    Pennington, Stephen R.
    Parnell, Andrew C.
    STATISTICS AND COMPUTING, 2018, 28 (04) : 869 - 890
  • [7] Bayesian Model Averaging and Jointness Measures for gretl
    Blazejowski, Marcin
    Kwiatkowski, Jacek
    JOURNAL OF STATISTICAL SOFTWARE, 2015, 68 (05): : 1 - 24
  • [8] Target Identification and Bayesian Model Averaging with Probabilistic Hierarchical Factor Probabilities
    Basener, Bill
    2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,
  • [9] Applying Bayesian model averaging for uncertainty estimation of input data in energy modelling
    Culka M.
    Energy, Sustainability and Society, 4 (1)
  • [10] Clustered Bayesian Model Averaging
    Yu, Qingzhao
    MacEachern, Steven N.
    Peruggia, Mario
    BAYESIAN ANALYSIS, 2013, 8 (04): : 883 - 907