Generating Fluent Chinese Adversarial Examples for Sentiment Classification

被引:0
作者
Wang, Congyi [1 ,2 ]
Zeng, Jianping [1 ,2 ]
Wu, Chengrong [1 ,2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai 200433, Peoples R China
来源
2020 IEEE 14TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID) | 2020年
基金
国家重点研发计划;
关键词
Adversarial examples; Chinese natural language; Sentiment classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Highly accurate classifiers can be trained by existing machine learning models, however, most of these classifiers do not consider the adversarial attack. This makes these classifiers vulnerable to adversarial examples. In order to improve the ability of sentiment classifiers to resist the adversarial attack, it is very important to generate high-quality adversarial examples. Most of the existing methods that generate natural language adversarial examples aim at English text with relatively simple strategies, but a single transformation strategy is easily detected by the defender. In this paper, we propose a new method to generate Chinese natural language adversarial examples, which is called AD-ER (Adversarial Examples with Readability). The first step is to select the important words in the text, which have great impact on the sentiment classifier. Then we proposed four variant strategies to replace the important words and the best candidate word is selected heuristically under the constraints of its readability and maximum entropy model. The simulation results on a real shopping review dataset verify that the examples generated by our method can produce large attack disturbance to the classifiers. Different from other examples, our examples have good readability and diversity, which are more fluent and harder to be detected.
引用
收藏
页码:149 / +
页数:6
相关论文
共 50 条
[21]   Generating Valid and Natural Adversarial Examples with Large Language Models [J].
Wang, Zimu ;
Wang, Wei ;
Chen, Qi ;
Wang, Qiufeng ;
Anh Nguyen .
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, :1716-1721
[22]   Generating traceable adversarial text examples by watermarking in the semantic space [J].
Li, Mingjie ;
Wu, Hanzhou ;
Zhang, Xinpeng .
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
[23]   A novel approach to generating high-resolution adversarial examples [J].
Xianjin Fang ;
Zhiwei Li ;
Gaoming Yang .
Applied Intelligence, 2022, 52 :1289-1305
[24]   IAE: Irony-Based Adversarial Examples for Sentiment Analysis Systems [J].
Yi, Xiaoyin ;
Huang, Jiacheng .
IEEE ACCESS, 2024, 12 :105605-105612
[25]   Countermeasures Against Adversarial Examples in Radio Signal Classification [J].
Zhang, Lu ;
Lambotharan, Sangarapillai ;
Zheng, Gan ;
AsSadhan, Basil ;
Roli, Fabio .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (08) :1830-1834
[26]   Generating Adversarial Examples of Source Code Classification Models via Q-Learning-Based Markov Decision Process [J].
Tian, Junfeng ;
Wang, Chenxin ;
Li, Zhen ;
Wen, Yu .
2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, :807-818
[27]   Generating Semantic Adversarial Examples via Feature Manipulation in Latent Space [J].
Wang, Shuo ;
Chen, Shangyu ;
Chen, Tianle ;
Nepal, Surya ;
Rudolph, Carsten ;
Grobler, Marthie .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) :17070-17084
[28]   Cooperative Co-evolutionary Genetic Algorithm for Generating Adversarial Examples [J].
Liang, Zilong ;
Huang, Minrui ;
Chan, Miuyi ;
Zhang, Xinyuan .
2024 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTICS AND AUTOMATIC CONTROL, IRAC, 2024, :402-405
[29]   TrojanForge: Generating Adversarial Hardware Trojan Examples Using Reinforcement Learning [J].
Sarihi, Amin ;
Jamieson, Peter ;
Patooghy, Ahmad ;
Badawy, Abdel-Hameed A. .
PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL SYMPOSIUM ON MACHINE LEARNING FOR CAD, MLCAD 2024, 2024,
[30]   Generating Robust Adversarial Examples against Online Social Networks (OSNs) [J].
Liu, Jun ;
Zhou, Jiantao ;
Wu, Haiwei ;
Sun, Weiwei ;
Tian, Jinyu .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)