Interpretable Adversarial Perturbation in Input Embedding Space for Text

被引:0
|
作者
Sato, Motoki [1 ,3 ,5 ]
Suzuki, Jun [2 ,4 ,6 ]
Shindo, Hiroyuki [3 ,4 ]
Matsumoto, Yuji [3 ,4 ]
机构
[1] Preferred Networks Inc, Tokyo, Japan
[2] NTT Commun Sci Labs, Kyoto, Japan
[3] Nara Inst Sci & Technol, Ikoma, Nara, Japan
[4] RIKEN Ctr Adv Intelligence Project, Tokyo, Japan
[5] RIKEN AIP, Tokyo, Japan
[6] Tohoku Univ, Sendai, Miyagi, Japan
来源
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Following great success in the image processing field, the idea of adversarial training has been applied to tasks in the natural language processing (NLP) field. One promising approach directly applies adversarial training developed in the image processing field to the input word embedding space instead of the discrete input space of texts. However, this approach abandons such interpretability as generating adversarial texts to significantly improve the performance of NLP tasks. This paper restores interpretability to such methods by restricting the directions of perturbations toward the existing words in the input embedding space. As a result, we can straightforwardly reconstruct each input with perturbations to an actual text by considering the perturbations to be the replacement of words in the sentence while maintaining or even improving the task performance(1).
引用
收藏
页码:4323 / 4330
页数:8
相关论文
共 50 条
  • [41] Adversarial Text Normalization
    Bitton, Joanna
    Pavlova, Maya
    Evtimov, Ivan
    2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 268 - 279
  • [42] Revision in Continuous Space: Unsupervised Text Style Transfer without Adversarial Learning
    Liu, Dayiheng
    Fu, Jie
    Zhang, Yidan
    Pal, Chris
    Lv, Jiancheng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8376 - 8383
  • [43] Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation
    Qin, Zeyu
    Fan, Yanbo
    Liu, Yi
    Shen, Li
    Zhang, Yong
    Wang, Jue
    Wu, Baoyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [44] Point Cloud Adversarial Perturbation Generation for Adversarial Attacks
    He, Fengmei
    Chen, Yihuai
    Chen, Ruidong
    Nie, Weizhi
    IEEE ACCESS, 2023, 11 : 2767 - 2774
  • [45] Adversarial Multi-Grained Embedding Network for Cross-Modal Text-Video Retrieval
    Han, Ning
    Chen, Jingjing
    Zhang, Hao
    Wang, Huanwen
    Chen, Hao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
  • [46] Instance Mask Embedding and Attribute-Adaptive Generative Adversarial Network for Text-to-Image Synthesis
    Ni, Jiancheng
    Zhang, Susu
    Zhou, Zili
    Hou, Jie
    Gao, Feng
    IEEE ACCESS, 2020, 8 (08): : 37697 - 37711
  • [47] Low Frequency Adversarial Perturbation
    Guo, Chuan
    Frank, Dared S.
    Weinberger, Kilian Q.
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 1127 - 1137
  • [48] Adaptive Perturbation for Adversarial Attack
    Yuan, Zheng
    Zhang, Jie
    Jiang, Zhaoyan
    Li, Liangliang
    Shan, Shiguang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5663 - 5676
  • [49] OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classification
    Lee, Seonghyeon
    Lee, Dongha
    Yu, Hwanjo
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 590 - 599
  • [50] Perturbation Type Categorization for Multiple Adversarial Perturbation Robustness
    Maini, Pratyush
    Chen, Xinyun
    Li, Bo
    Song, Dawn
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 1317 - 1327