Adversarial Learning Games with Deep Learning Models

被引：0

作者：

Chivukula, Aneesh Sreevallabh ^{[1
]}

Liu, Wei ^{[1
]}

机构：

[1] Univ Technol Sydney, Adv Analyt Inst, Sydney, NSW, Australia

来源：

2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2017年

关键词：

Supervised learning; Data mining and knowledge discovery; Evolutionary learning; Adversarial learning; Deep learning; Genetic algorithms; Game theory;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning has been found to be vulnerable to changes in the data distribution. This means that inputs that have an imperceptibly and immeasurably small difference from training data correspond to a completely different class label in deep learning. Thus an existing deep learning network like a Convolutional Neural Network (CNN) is vulnerable to adversarial examples. We design an adversarial learning algorithm for supervised learning in general and CNNs in particular. Adversarial examples are generated by a game theoretic formulation on the performance of deep learning. In the game, the interaction between an intelligent adversary and deep learning model is a two-person sequential noncooperative Stackelberg game with stochastic payoff functions. The Stackelberg game is solved by the Nash equilibrium which is a pair of strategies (learner weights and genetic operations) from which there is no incentive for either learner or adversary to deviate. The algorithm performance is evaluated under different strategy spaces on MNIST handwritten digits data. We show that the Nash equilibrium leads to solutions robust to subsequent adversarial data manipulations. Results suggest that game theory and stochastic optimization algorithms can be used to study performance vulnerabilities in deep learning models.

引用

页码：2758 / 2767

页数：10

共 23 条

[1] Parallelism and evolutionary algorithms
Alba, E
Tomassini, M
[J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (05) : 443 - 462
[2] [Anonymous], 2006, P 23 INT C MACHINE, DOI DOI 10.1145/1143844.1143889
[3] [Anonymous], 2014, Towards deep neural network architectures robust to adversarial examples
[4] Barth A, 2010, LECT NOTES COMPUT SC, V6052, P192, DOI 10.1007/978-3-642-14577-3_16
[5] Biggio B., 2012, ARXIV PREPRINT ARXIV
[6] Security Evaluation of Pattern Classifiers under Attack
Biggio, Battista
Fumera, Giorgio
Roli, Fabio
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (04) : 984 - 996
[7] Multiple classifier systems for robust classifier design in adversarial environments
Biggio, Battista
Fumera, Giorgio
Roli, Fabio
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2010, 1 (1-4) : 27 - 41
[8] Burges C. J, 1998, The mnist database
[9] Deng L., 2012, APSIPA transactions on signal and information processing
[10] Duda R O., 2012, Pattern Classification

← 1 2 3 →