Extract interpretability-accuracy balanced rules from artificial neural networks: A review

被引:62
作者
He, Congjie [1 ]
Ma, Meng [2 ]
Wang, Ping [1 ,2 ,3 ]
机构
[1] Peking Univ, Sch Software & Microelect, Beijing 102600, Peoples R China
[2] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China
[3] Minist Educ, Key Lab High Confidence Software Technol PKU, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Rule extraction; Accuracy; Interpretability; Multilayer Perceptron; Deep neural network; CLASSIFICATION PROBLEMS; DECISION RULES; INDUCTION; ISSUES; TREE;
D O I
10.1016/j.neucom.2020.01.036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial neural networks (ANN) have been widely used and have achieved remarkable achievements. However, neural networks with high accuracy and good performance often have extremely complex internal structures such as deep neural networks (DNN). This shortcoming makes the neural networks as incomprehensible as a black box, which is unacceptable in some practical applications. But pursuing excessive interpretation of the neural networks will make the performance of the model worse. Based on this contradictory issue, we first summarize the mainstream methods about quantitatively evaluating the accuracy and interpretability of rule set. And then review existing methods on extracting rules from Multilayer Perceptron (MLP) and DNN in three categories: Decomposition Approach (Extract rules in neuron level such as visualizing the structure of network), Pedagogical Approach (By studying the correspondence between input and output such as by computing gradient) and Eclectics Approach (Combine the above two ideas). Some potential research directions about extracting rules from DNN are discussed in the last. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:346 / 358
页数:13
相关论文
共 95 条
  • [21] Cameron AC, 1997, J ECONOMETRICS, V77, P329
  • [22] Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission
    Caruana, Rich
    Lou, Yin
    Gehrke, Johannes
    Koch, Paul
    Sturm, Marc
    Elhadad, Noemie
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1721 - 1730
  • [23] Che Zhengping, 2016, AMIA Annu Symp Proc, V2016, P371
  • [24] Stochastic stability for distributed delay neural networks via augmented Lyapunov-Krasovskii functionals
    Chen, Yonggang
    Wang, Zidong
    Liu, Yurong
    Alsaadi, Fuad E.
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2018, 338 : 869 - 881
  • [25] Further results on passivity analysis of delayed neural networks with leakage delay
    Chen, Yonggang
    Fu, Zhumu
    Liu, Yurong
    Alsaadi, Fuad E.
    [J]. NEUROCOMPUTING, 2017, 224 : 135 - 141
  • [26] A Pareto-based multi-objective evolutionary approach to the identification of Mamdani fuzzy systems
    Cococcioni, Marco
    Ducange, Pietro
    Lazzerini, Beatrice
    Marcelloni, Francesco
    [J]. SOFT COMPUTING, 2007, 11 (11) : 1013 - 1031
  • [27] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
    COHEN, J
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
  • [28] Visualizing and Understanding Neural Machine Translation
    Ding, Yanzhuo
    Liu, Yang
    Luan, Huanbo
    Sun, Maosong
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1150 - 1159
  • [29] Inverting Visual Representations with Convolutional Networks
    Dosovitskiy, Alexey
    Brox, Thomas
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4829 - 4837
  • [30] Support vector machines for spam categorization
    Drucker, H
    Wu, DH
    Vapnik, VN
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1048 - 1054