A Simple Convolutional Neural Network with Rule Extraction

被引：21

作者：

Bologna, Guido ^{[1
,2
]}

机构：

[1] Univ Appl Sci & Arts Western Switzerland, Dept Comp Sci, Rue Prairie 4, CH-1202 Geneva, Switzerland

[2] Univ Appl Sci & Arts Western Switzerland, Rue Prairie 4, CH-1202 Geneva, Switzerland

来源：

APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 12期

关键词：

CNN; model explanation; rule extraction; sentiment analysis; n-grams;

D O I：

10.3390/app9122411

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Classification responses provided by Multi Layer Perceptrons (MLPs) can be explained by means of propositional rules. So far, many rule extraction techniques have been proposed for shallow MLPs, but not for Convolutional Neural Networks (CNNs). To fill this gap, this work presents a new rule extraction method applied to a typical CNN architecture used in Sentiment Analysis (SA). We focus on the textual data on which the CNN is trained with tweets of movie reviews. Its architecture includes an input layer representing words by word embeddings, a convolutional layer, a max-pooling layer, followed by a fully connected layer. Rule extraction is performed on the fully connected layer, with the help of the Discretized Interpretable Multi Layer Perceptron (DIMLP). This transparent MLP architecture allows us to generate symbolic rules, by precisely locating axis-parallel hyperplanes. Experiments based on cross-validation emphasize that our approach is more accurate than that based on SVMs and decision trees that substitute DIMLPs. Overall, rules reach high fidelity and the discriminative n-grams represented in the antecedents explain the classifications adequately. With several test examples we illustrate the n-grams represented in the activated rules. They present the particularity to contribute to the final classification with a certain intensity.

引用

页数：23

共 48 条

[1] Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) [J].

Adadi, Amina ;

Berrada, Mohammed .

IEEE ACCESS, 2018, 6 :52138-52160

[2] Survey and critique of techniques for extracting rules from trained artificial neural networks [J].

Andrews, R ;

Diederich, J ;

Tickle, AB .

KNOWLEDGE-BASED SYSTEMS, 1995, 8 (06) :373-389

[3]

[Anonymous], 1998, STAT LEARNING THEORY

[4]

[Anonymous], P 3 INT C LEARNING R

[5]

[Anonymous], 2018, stat

[6]

[Anonymous], 2018, ADV NEURAL INFORM PR

[7]

[Anonymous], 2018, ABS180200121 CORR

[8]

[Anonymous], 2017, ARXIV171209923

[9]

[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386

[10]

[Anonymous], 2017, ARXIV170304730

← 1 2 3 4 5 →