Revealing the structure of prediction models through feature interaction detection

被引:3
作者
Zhang, Xiaohang [1 ,2 ]
Zhang, Hanying [1 ]
Zhu, Ji [3 ]
Li, Zhengren [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Econ & Management, Beijing, Peoples R China
[2] Beijing Univ Posts & Telecommun, Key Lab Trustworthy Distributed Comp & Serv, Beijing, Peoples R China
[3] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
[4] Beijing Univ Posts & Telecommun, Sch Modern Post, Beijing, Peoples R China
关键词
Feature interaction; High dimensional model representation; Model interpretation; Black box; SELECTION;
D O I
10.1016/j.knosys.2021.107737
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, machine learning models have been employed for prediction in various domains. While the prediction performance has obviously improved, some models have become too complex to understand, and these models are called black-box models. Detecting the feature interactions is a useful technique to gain insight into the structure of black-box models. In this paper, we propose a method based on high dimensional model representation (HDMR) to reveal the structure of prediction models by detecting the feature interactions that are embedded in the models. The HDMR-based method can detect the k-way interactions without any constraints on k and can detect the interactions from both classification and regression models. Moreover, this method is model-agnostic and can detect both global and local interactions. Experiments on some synthetic and real datasets demonstrate that the HDMR-based method can detect feature interactions effectively and improve the model interpretability. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 36 条
[1]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[2]   Bankruptcy prediction by generalized additive models [J].
Berg, Daniel .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2007, 23 (02) :129-143
[3]   Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission [J].
Caruana, Rich ;
Lou, Yin ;
Gehrke, Johannes ;
Koch, Paul ;
Sturm, Marc ;
Elhadad, Noemie .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1721-1730
[4]   Learning Global Pairwise Interactions with Bayesian Neural Networks [J].
Cui, Tianyu ;
Marttinen, Pekka ;
Kaski, Samuel .
ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 :1087-1094
[5]   Discovering Unexpected Local Nonlinear Interactions in Scientific Black-box Models [J].
Doron, Michael ;
Segev, Idan ;
Shahaf, Dafna .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :425-435
[6]  
Fisher RA., 1970, Breakthroughs in Statistics: Methodology and Distribution, P66, DOI DOI 10.1007/978-1-4612-4380-96
[7]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[8]   PREDICTIVE LEARNING VIA RULE ENSEMBLES [J].
Frieman, Jerome H. ;
Popescu, Bogdan E. .
ANNALS OF APPLIED STATISTICS, 2008, 2 (03) :916-954
[9]  
Greenwell B.M, 2018, A Simple and Effective Model-Based Variable Importance Measure
[10]   A Survey of Methods for Explaining Black Box Models [J].
Guidotti, Riccardo ;
Monreale, Anna ;
Ruggieri, Salvatore ;
Turin, Franco ;
Giannotti, Fosca ;
Pedreschi, Dino .
ACM COMPUTING SURVEYS, 2019, 51 (05)