mSHAP: SHAP Values for Two-Part Models

被引:11
作者
Matthews, Spencer [1 ]
Hartman, Brian [2 ]
机构
[1] Univ Calif Irvine, Dept Stat, Donald Bren Sch Informat & Comp Sci, Irvine, CA 92697 USA
[2] Brigham Young Univ, Coll Phys & Math Sci, Dept Stat, Provo, UT 84602 USA
关键词
explainability; machine learning; ratemaking;
D O I
10.3390/risks10010003
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Two-part models are important to and used throughout insurance and actuarial science. Since insurance is required for registering a car, obtaining a mortgage, and participating in certain businesses, it is especially important that the models that price insurance policies are fair and non-discriminatory. Black box models can make it very difficult to know which covariates are influencing the results, resulting in model risk and bias. SHAP (SHapley Additive exPlanations) values enable interpretation of various black box models, but little progress has been made in two-part models. In this paper, we propose mSHAP (or multiplicative SHAP), a method for computing SHAP values of two-part models using the SHAP values of the individual models. This method will allow for the predictions of two-part models to be explained at an individual observation level. After developing mSHAP, we perform an in-depth simulation study. Although the kernelSHAP algorithm is also capable of computing approximate SHAP values for a two-part model, a comparison with our method demonstrates that mSHAP is exponentially faster. Ultimately, we apply mSHAP to a two-part ratemaking model for personal auto property damage insurance coverage. Additionally, an R package (mshap) is available to easily implement the method in a wide variety of applications.
引用
收藏
页数:23
相关论文
共 18 条
[1]  
Ablad Mouad., 2020 6 IEEE C INF SC, P110
[2]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[3]  
Besold T. R., 2017, arXiv preprint arXiv:1710.00794
[4]   HOUSEHOLD LIFE INSURANCE DEMAND: A MULTIVARIATE TWO-PART MODEL [J].
Frees, Edward W. ;
Sun, Yunjie .
NORTH AMERICAN ACTUARIAL JOURNAL, 2010, 14 (03) :338-354
[5]   A Priori Ratemaking Selection Using Multivariate Regression Models Allowing Different Coverages in Auto Insurance [J].
Gomez-Deniz, Emilio ;
Calderin-Ojeda, Enrique .
RISKS, 2021, 9 (07)
[6]  
Gunning D., 2017, Explainable artificial intelligence (XAI)
[7]  
H2O.ai, 2021, H2O R PACK VERS 3 34
[8]   An application of two-stage quantile regression to insurance ratemaking [J].
Heras, Antonio ;
Moreno, Ignacio ;
Vilar-Zanon, Jose L. .
SCANDINAVIAN ACTUARIAL JOURNAL, 2018, (09) :753-769
[9]  
Kemi Akinyemi, USE ADV PREDICTIVE A
[10]   A novel varistructure grey forecasting model with speed adaptation and its application [J].
Li, Shoujun ;
Miao, Yanzi ;
Li, Guangyu ;
Ikram, Muhammad .
MATHEMATICS AND COMPUTERS IN SIMULATION, 2020, 172 :45-70