Interpretable Adversarial Perturbation in Input Embedding Space for Text

被引：0

作者：

Sato, Motoki ^{[1
,3
,5
]}

Suzuki, Jun ^{[2
,4
,6
]}

Shindo, Hiroyuki ^{[3
,4
]}

Matsumoto, Yuji ^{[3
,4
]}

机构：

[1] Preferred Networks Inc, Tokyo, Japan

[2] NTT Commun Sci Labs, Kyoto, Japan

[3] Nara Inst Sci & Technol, Ikoma, Nara, Japan

[4] RIKEN Ctr Adv Intelligence Project, Tokyo, Japan

[5] RIKEN AIP, Tokyo, Japan

[6] Tohoku Univ, Sendai, Miyagi, Japan

来源：

PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Following great success in the image processing field, the idea of adversarial training has been applied to tasks in the natural language processing (NLP) field. One promising approach directly applies adversarial training developed in the image processing field to the input word embedding space instead of the discrete input space of texts. However, this approach abandons such interpretability as generating adversarial texts to significantly improve the performance of NLP tasks. This paper restores interpretability to such methods by restricting the directions of perturbations toward the existing words in the input embedding space. As a result, we can straightforwardly reconstruct each input with perturbations to an actual text by considering the perturbations to be the replacement of words in the sentence while maintaining or even improving the task performance(1).

引用

页码：4323 / 4330

页数：8

共 50 条

[41] Adversarial Text Normalization
Bitton, Joanna
Pavlova, Maya
Evtimov, Ivan
2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 268 - 279
[42] Revision in Continuous Space: Unsupervised Text Style Transfer without Adversarial Learning
Liu, Dayiheng
Fu, Jie
Zhang, Yidan
Pal, Chris
Lv, Jiancheng
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8376 - 8383
[43] Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation
Qin, Zeyu
Fan, Yanbo
Liu, Yi
Shen, Li
Zhang, Yong
Wang, Jue
Wu, Baoyuan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[44] Point Cloud Adversarial Perturbation Generation for Adversarial Attacks
He, Fengmei
Chen, Yihuai
Chen, Ruidong
Nie, Weizhi
IEEE ACCESS, 2023, 11 : 2767 - 2774
[45] Adversarial Multi-Grained Embedding Network for Cross-Modal Text-Video Retrieval
Han, Ning
Chen, Jingjing
Zhang, Hao
Wang, Huanwen
Chen, Hao
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
[46] Instance Mask Embedding and Attribute-Adaptive Generative Adversarial Network for Text-to-Image Synthesis
Ni, Jiancheng
Zhang, Susu
Zhou, Zili
Hou, Jie
Gao, Feng
IEEE ACCESS, 2020, 8 (08): : 37697 - 37711
[47] Low Frequency Adversarial Perturbation
Guo, Chuan
Frank, Dared S.
Weinberger, Kilian Q.
35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 1127 - 1137
[48] Adaptive Perturbation for Adversarial Attack
Yuan, Zheng
Zhang, Jie
Jiang, Zhaoyan
Li, Liangliang
Shan, Shiguang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5663 - 5676
[49] OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classification
Lee, Seonghyeon
Lee, Dongha
Yu, Hwanjo
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 590 - 599
[50] Perturbation Type Categorization for Multiple Adversarial Perturbation Robustness
Maini, Pratyush
Chen, Xinyun
Li, Bo
Song, Dawn
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 1317 - 1327

← 1 2 3 4 5 →