Generating Adversarial Examples of Source Code Classification Models via Q-Learning-Based Markov Decision Process

被引:5
作者
Tian, Junfeng [1 ]
Wang, Chenxin [1 ]
Li, Zhen [1 ]
Wen, Yu [1 ]
机构
[1] Hebei Univ, Baoding, Peoples R China
来源
2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021) | 2021年
基金
中国国家自然科学基金;
关键词
adversarial examples; adversarial training; robustness; source code; deep learning; SYSTEM;
D O I
10.1109/QRS54544.2021.00090
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Adversarial robustness becomes an essential concern in Deep Learning (DL)-based source code processing, as DL models are vulnerable to the deception by attackers. To address a new challenge posed by the discrete and structural nature of source code to generate adversarial examples for DL models, and the insufficient focus of existing methods on code structural features, we propose a Q-Learning-based Markov decision process (QMDP) performing semantically equivalent transformations on the source code structure. Two key issues are mainly addressed: (i) how to perform attacks on source code structural information and (ii) what transformations to perform when and where in the source code. We demonstrate that effectively tackling these two issues is crucial for generating adversarial examples for source code. By evaluating C/C++ programs working on the source code classification task, we verified that QMDP can effectively generate adversarial examples and improve the robustness of DL models over 44%.
引用
收藏
页码:807 / 818
页数:12
相关论文
共 31 条
[1]  
Allamanis M, 2016, PR MACH LEARN RES, V48
[2]  
Alzantot M, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2890
[3]  
[Anonymous], 2014, Learning to execute
[4]  
Bielik P, 2020, PR MACH LEARN RES, V119
[5]  
Bui NDQ, 2021, AAAI CONF ARTIF INTE, V35, P30
[6]  
Dai HJ, 2018, PR MACH LEARN RES, V80
[7]  
Duan X, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4665
[8]  
Ebrahimi J, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P31
[9]   Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers [J].
Gao, Ji ;
Lanchantin, Jack ;
Soffa, Mary Lou ;
Qi, Yanjun .
2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2018), 2018, :50-56
[10]  
Goodfellow I. J., 2015, P 3 INT CONFER ENCE