What's inside the black-box? A genetic programming method for interpreting complex machine learning models

被引:30
作者
Evans, Benjamin P. [1 ]
Xue, Bing [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
来源
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19) | 2019年
关键词
Explainable Artificial Intelligence; Interpretable Machine Learning; Evolutionary Multi-objective Optimisation;
D O I
10.1145/3321707.3321726
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpreting state-of-the-art machine learning algorithms can be difficult. For example, why does a complex ensemble predict a particular class? Existing approaches to interpretable machine learning tend to be either local in their explanations, apply only to a particular algorithm, or overly complex in their global explanations. In this work, we propose a global model extraction method which uses multi-objective genetic programming to construct accurate, simplistic and model-agnostic representations of complex black-box estimators. We found the resulting representations are far simpler than existing approaches while providing comparable reconstructive performance. This is demonstrated on a range of datasets, by approximating the knowledge of complex black-box models such as 200 layer neural networks and ensembles of 500 trees, with a single tree.
引用
收藏
页码:1012 / 1020
页数:9
相关论文
共 36 条
[1]  
[Anonymous], 2016, ARXIV160208610
[2]  
[Anonymous], 2017, ARXIV170609773
[3]  
[Anonymous], 2018, CORR
[4]  
[Anonymous], 1993, MULTIINTERVAL DISCRE
[5]  
[Anonymous], 2012, ESANN 2012 P 20 EURO
[6]  
[Anonymous], 2010, Deconvolutional networks
[7]   Evolution strategies – A comprehensive introduction [J].
Hans-Georg Beyer ;
Hans-Paul Schwefel .
Natural Computing, 2002, 1 (1) :3-52
[8]  
Bolukbasi T., 2016, Advances in Neural Information Processing Systems (NeurIPS), P4349, DOI DOI 10.5555/3157382.3157584
[9]   Business data mining - a machine learning perspective [J].
Bose, I ;
Mahapatra, RK .
INFORMATION & MANAGEMENT, 2001, 39 (03) :211-225
[10]  
Buciluaˇ C., 2006, P 12 ACM SIGKDD INT, P535, DOI DOI 10.1145/1150402.1150464