Extract interpretability-accuracy balanced rules from artificial neural networks: A review

被引：62

作者：

He, Congjie ^{[1
]}

Ma, Meng ^{[2
]}

Wang, Ping ^{[1
,2
,3
]}

机构：

[1] Peking Univ, Sch Software & Microelect, Beijing 102600, Peoples R China

[2] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China

[3] Minist Educ, Key Lab High Confidence Software Technol PKU, Beijing, Peoples R China

来源：

NEUROCOMPUTING | 2020年 / 387卷 / 387期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Rule extraction; Accuracy; Interpretability; Multilayer Perceptron; Deep neural network; CLASSIFICATION PROBLEMS; DECISION RULES; INDUCTION; ISSUES; TREE;

D O I：

10.1016/j.neucom.2020.01.036

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Artificial neural networks (ANN) have been widely used and have achieved remarkable achievements. However, neural networks with high accuracy and good performance often have extremely complex internal structures such as deep neural networks (DNN). This shortcoming makes the neural networks as incomprehensible as a black box, which is unacceptable in some practical applications. But pursuing excessive interpretation of the neural networks will make the performance of the model worse. Based on this contradictory issue, we first summarize the mainstream methods about quantitatively evaluating the accuracy and interpretability of rule set. And then review existing methods on extracting rules from Multilayer Perceptron (MLP) and DNN in three categories: Decomposition Approach (Extract rules in neuron level such as visualizing the structure of network), Pedagogical Approach (By studying the correspondence between input and output such as by computing gradient) and Eclectics Approach (Combine the above two ideas). Some potential research directions about extracting rules from DNN are discussed in the last. (C) 2020 Elsevier B.V. All rights reserved.

引用

页码：346 / 358

页数：13

共 95 条

[21] Cameron AC, 1997, J ECONOMETRICS, V77, P329
[22] Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission
Caruana, Rich
Lou, Yin
Gehrke, Johannes
Koch, Paul
Sturm, Marc
Elhadad, Noemie
[J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1721 - 1730
[23] Che Zhengping, 2016, AMIA Annu Symp Proc, V2016, P371
[24] Stochastic stability for distributed delay neural networks via augmented Lyapunov-Krasovskii functionals
Chen, Yonggang
Wang, Zidong
Liu, Yurong
Alsaadi, Fuad E.
[J]. APPLIED MATHEMATICS AND COMPUTATION, 2018, 338 : 869 - 881
[25] Further results on passivity analysis of delayed neural networks with leakage delay
Chen, Yonggang
Fu, Zhumu
Liu, Yurong
Alsaadi, Fuad E.
[J]. NEUROCOMPUTING, 2017, 224 : 135 - 141
[26] A Pareto-based multi-objective evolutionary approach to the identification of Mamdani fuzzy systems
Cococcioni, Marco
Ducange, Pietro
Lazzerini, Beatrice
Marcelloni, Francesco
[J]. SOFT COMPUTING, 2007, 11 (11) : 1013 - 1031
[27] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
COHEN, J
[J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
[28] Visualizing and Understanding Neural Machine Translation
Ding, Yanzhuo
Liu, Yang
Luan, Huanbo
Sun, Maosong
[J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1150 - 1159
[29] Inverting Visual Representations with Convolutional Networks
Dosovitskiy, Alexey
Brox, Thomas
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4829 - 4837
[30] Support vector machines for spam categorization
Drucker, H
Wu, DH
Vapnik, VN
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1048 - 1054

← 1 2 3 4 5 6 7 8 9 10 →