Robustness-Aware Word Embedding Improves Certified Robustness to Adversarial Word Substitutions

被引:0
|
作者
Wang, Yibin [1 ]
Yang, Yichen [1 ]
He, Di [2 ]
He, Kun [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Peking Univ, Sch Intelligence Sci & Technol, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural Language Processing (NLP) models have gained great success on clean texts, but they are known to be vulnerable to adversarial examples typically crafted by synonym substitutions. In this paper, we target to solve this problem and find that word embedding is important to the certified robustness of NLP models. Given the findings, we propose the Embedding Interval Bound Constraint (EIBC) triplet loss to train robustness-aware word embeddings for better certified robustness. We optimize the EIBC triplet loss to reduce distances between synonyms in the embedding space, which is theoretically proven to make the verification boundary tighter. Meanwhile, we enlarge distances among non-synonyms, maintaining the semantic representation of word embeddings. Our method is conceptually simple and componentized. It can be easily combined with IBP training and improves the certified robust accuracy from 76.73% to 84.78% on the IMDB dataset. Experiments demonstrate that our method outperforms various state-of-the-art certified defense baselines and generalizes well to unseen substitutions. The code is available at https://github.com/JHL-HUST/EIBC-IBP/.
引用
收藏
页码:673 / 687
页数:15
相关论文
共 50 条
  • [21] Adversarial Robustness of Probabilistic Network Embedding for Link Prediction
    Chen, Xi
    Kang, Bo
    Lijffijt, Jefrey
    Bie, Tijl De
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2021, 1525 : 22 - 38
  • [22] Improving Robustness-aware Design Space Exploration for FPGA-based Systems
    Tuzov, Ilya
    de Andres, David
    Ruiz, Juan-Carlos
    2020 16TH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC 2020), 2020, : 1 - 8
  • [23] Wasserstein Smoothing: Certified Robustness against Wasserstein Adversarial Attacks
    Levine, Alexander
    Feizi, Soheil
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 3938 - 3946
  • [24] Chiron: A Robustness-Aware Incentive Scheme for Edge Learning via Hierarchical Reinforcement Learning
    Liu, Yi
    Guo, Song
    Zhan, Yufeng
    Wu, Leijie
    Hong, Zicong
    Zhou, Qihua
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (08) : 8508 - 8524
  • [25] Enhancing Adversarial Robustness via Anomaly-aware Adversarial Training
    Tang, Keke
    Lou, Tianrui
    He, Xu
    Shi, Yawen
    Zhu, Peican
    Gu, Zhaoquan
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2023, 2023, 14117 : 328 - 342
  • [26] Confidence-Aware Training of Smoothed Classifiers for Certified Robustness
    Jeong, Jongheon
    Kim, Seojin
    Shin, Jinwoo
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8005 - 8013
  • [27] Robustness-Aware Sleep Transistor Engineering for Power-Gated Nanometer Subthreshold Circuits
    Bol, David
    Hocquet, Cedric
    Flandre, Denis
    Legat, Jean-Didier
    2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 1484 - 1487
  • [28] Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook
    Song, Ziying
    Liu, Lin
    Jia, Feiyang
    Luo, Yadan
    Jia, Caiyan
    Zhang, Guoxin
    Yang, Lei
    Wang, Li
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15407 - 15436
  • [29] Toward Certified Robustness of Graph Neural Networks in Adversarial AIoT Environments
    Lai, Yuni
    Zhou, Jialong
    Zhang, Xiaoge
    Zhou, Kai
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) : 13920 - 13932
  • [30] Edge enhancement improves adversarial robustness in image classification
    He, Lirong
    Ai, Qingzhong
    Lei, Yuqing
    Pan, Lili
    Ren, Yazhou
    Xu, Zenglin
    NEUROCOMPUTING, 2023, 518 : 122 - 132