Multicollinearity Correction and Combined Feature Effect in Shapley Values

被引：7

作者：

Basu, Indranil ^{[1
]}

Maji, Subhadip ^{[2
]}

机构：

[1] Optum Global Solut, Hyderabad, India

[2] Optum Global Solut, Bangalore, Karnataka, India

来源：

AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年 / 13151卷

关键词：

Model interpretation; Multicollinearity; Feature extraction; Shapley values;

D O I：

10.1007/978-3-030-97546-3_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Model interpretability is one of the most intriguing problems in most machine learning models, particularly for those that are mathematically sophisticated. Computing Shapley Values are one of the best approaches so far to find the importance of each feature in a model, at the instance (data point) level. In other words, Shapley values represent the importance of a feature for a particular instance or observation, especially for classification or regression problems. One of the well known limitations of Shapley values is that the estimation of Shapley values with the presence of multicollinearity among the features are not accurate as well as reliable. To address this problem, we present a unified framework to calculate accurate Shapley values with correlated features. To be more specific, we do an adjustment (matrix formulation) of the features while calculating independent Shapley values for the instances to make the features independent with each other. Our implementation of this method proves that our method is computationally efficient also, compared to the existing Shapley method.

引用

页码：79 / 90

页数：12

共 9 条

[1] Explaining individual predictions when features are dependent: More accurate approximations to Shapley values [J].

Aas, Kjersti ;

Jullum, Martin ;

Loland, Anders .

ARTIFICIAL INTELLIGENCE, 2021, 298

[2] Feature selection via coalitional game theory [J].

Cohen, Shay ;

Dror, Gideon ;

Ruppin, Eytan .

NEURAL COMPUTATION, 2007, 19 (07) :1939-1961

[3]

Kaggle, 2016, HOUS PRIC ADV REGR T

[4]

Lundberg SM, 2017, ADV NEUR IN, V30

[5]

Molnar C., 2019, MODEL AGNOSTIC METHO

[6] Evaluating Tests in Medical Diagnosis: Combining Machine Learning with Game-Theoretical Concepts [J].

Pfannschmidt, Karlson ;

Huellermeier, Eyke ;

Held, Susanne ;

Neiger, Reto .

INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS, IPMU 2016, PT I, 2016, 610 :450-461

[7]

Shapley L. S., 1953, CONTRIBUTIONS THEORY, V28, P307, DOI DOI 10.1515/9781400881970-018

[8] Explaining prediction models and individual predictions with feature contributions [J].

Strumbelj, Erik ;

Kononenko, Igor .

KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (03) :647-665

[9]

Sundararajan M., 2019, INT C MACHINE LEARNI

← 1 →