Towards a Robust Deep Neural Network Against Adversarial Texts: A Survey

被引：33

作者：

Wang, Wenqi ^{[1
,2
]}

Wang, Run ^{[1
,2
]}

Wang, Lina ^{[1
,2
]}

Wang, Zhibo ^{[2
,3
]}

Ye, Aoshuang ^{[1
,2
]}

机构：

[1] Wuhan Univ, Key Lab Aerosp Informat Secur & Trusted Comp, Minist Educ, Wuhan 430072, Hubei, Peoples R China

[2] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430072, Hubei, Peoples R China

[3] Zhejiang Univ, Sch Cyber Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2023年 / 35卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Perturbation methods; Natural language processing; Robustness; Information integrity; Analytical models; Sentiment analysis; Adversarial attacks and defenses; adversarial texts; robustness; deep neural networks; natural language processing;

D O I：

10.1109/TKDE.2021.3117608

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks (DNNs) have achieved remarkable success in various tasks (e.g., image classification, speech recognition, and natural language processing (NLP)). However, researchers have demonstrated that DNN-based models are vulnerable to adversarial examples, which cause erroneous predictions by adding imperceptible perturbations into legitimate inputs. Recently, studies have revealed adversarial examples in the text domain, which could effectively evade various DNN-based text analyzers and further bring the threats of the proliferation of disinformation. In this paper, we give a comprehensive survey on the existing studies of adversarial techniques for generating adversarial texts written by both English and Chinese characters and the corresponding defense methods. More importantly, we hope that our work could inspire future studies to develop more robust DNN-based text analyzers against known and unknown adversarial techniques. We classify the existing adversarial techniques for crafting adversarial texts based on the perturbation units, helping to better understand the generation of adversarial texts and build robust models for defense. In presenting the taxonomy of adversarial attacks and defenses in the text domain, we introduce the adversarial techniques from the perspective of different NLP tasks. Finally, we discuss the existing challenges of adversarial attacks and defenses in texts and present the future research directions in this emerging and challenging field.

引用

页码：3159 / 3179

页数：21

共 233 条

[31]

Carlini N., 2019, ARXIV190206705

[32]

Carlini N., 2019, COMPLETE LIST ALL AR

[33] Evading Deepfake-Image Detectors with White- and Black-Box Attacks [J].

Carlini, Nicholas ;

Farid, Hany .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :2804-2813

[34] Towards Evaluating the Robustness of Neural Networks [J].

Carlini, Nicholas ;

Wagner, David .

2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, :39-57

[35] Audio Adversarial Examples: Targeted Attacks on Speech-to-Text [J].

Carlini, Nicholas ;

Wagner, David .

2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2018), 2018, :1-7

[36] Reading Wikipedia to Answer Open-Domain Questions [J].

Chen, Danqi ;

Fisch, Adam ;

Weston, Jason ;

Bordes, Antoine .

PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1870-1879

[37]

Chen L., 2020, P ACL, P8801

[38] Enhanced LSTM for Natural Language Inference [J].

Chen, Qian ;

Zhu, Xiaodan ;

Ling, Zhenhua ;

Wei, Si ;

Jiang, Hui ;

Inkpen, Diana .

PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1657-1668

[39]

Chen ZY, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P183

[40]

Cheng MH, 2020, AAAI CONF ARTIF INTE, V34, P3601

← 1 2 3 4 5 6 7 8 9 10 →