Explaining the Performance of Black Box Regression Models

被引：4

作者：

Areosa, Ines ^{[1
]}

Torgo, Luis ^{[2
]}

机构：

[1] INESC TEC, LIAAD, Porto, Portugal

[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019) | 2019年

关键词：

D O I：

10.1109/DSAA.2019.00025

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The widespread usage of Machine Learning and Data Mining models in several key areas of our societies has raised serious concerns in terms of accountability and ability to justify and interpret the decisions of these models. This is even more relevant when models are too complex and often regarded as black boxes. In this paper we present several tools designed to help in understanding and explaining the reasons for the observed predictive performance of black box regression models. We describe, evaluate and propose several variants of Error Dependence Plots. These plots provide a visual display of the expected relationship between the prediction error of any model and the values of a predictor variable. They allow the end user to understand what to expect from the models given some concrete values of the predictor variables. These tools allow more accurate explanations on the conditions that may lead to some failures of the models. Moreover, our proposed extensions also provide a multivariate perspective of this analysis, and the ability to compare the behaviour of multiple models under different conditions. This comparative analysis empowers the end user with the ability to have a case-based analysis of the risks associated with different models, and thus select the model with lower expected risk for each test case, or even decide not to use any model because the expected error is unacceptable.

引用

页码：110 / 118

页数：9

共 22 条

[1]

[Anonymous], 2011, INT C MACHINE LEARNI

[2]

[Anonymous], 2011, e1071: Misc Functions of the Department of Statistics (e1071)

[3]

[Anonymous], 2018, Model class reliance: Variable importance measures for any machine learning model class, from the 'rashomon' perspective'

[4]

Areosa L., 2019, P 19 EPIA C ART INT