Improving Robustness of Language Models from a Geometry-aware Perspective

被引:0
|
作者
Zhu, Bin [1 ]
Gu, Zhaoquan [1 ,2 ]
Wang, Le [1 ,2 ]
Chen, Jinyin [3 ]
Xuan, Qi [3 ]
机构
[1] Guangzhou Univ, Cyberspace Inst Adv Technol CIAT, Guangzhou 510006, Peoples R China
[2] Peng Cheng Lab, Inst Cyberspace Platform, Shenzhen 999077, Peoples R China
[3] Zhejiang Univ Technol, Inst Cyberspace Secur, Hangzhou 310023, Peoples R China
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022) | 2022年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies have found that removing the norm-bounded projection and increasing search steps in adversarial training can significantly improve robustness. However, we observe that a too large number of search steps can hurt accuracy. We aim to obtain strong robustness efficiently using fewer steps. Through a toy experiment, we find that perturbing the clean data to the decision boundary but not crossing it does not degrade the test accuracy. Inspired by this, we propose friendly adversarial data augmentation (FADA) to generate friendly adversarial data. On top of FADA, we propose geometry-aware adversarial training (GAT) to perform adversarial training on friendly adversarial data so that we can save a large number of search steps. Comprehensive experiments across two widely used datasets and three pretrained language models demonstrate that GAT can obtain stronger robustness via fewer steps. In addition, we provide extensive empirical results and in-depth analyses on robustness to facilitate future studies.
引用
收藏
页码:3115 / 3125
页数:11
相关论文
共 50 条
  • [1] GPSFun: geometry-aware protein sequence function predictions with language models
    Yuan, Qianmu
    Tian, Chong
    Song, Yidong
    Ou, Peihua
    Zhu, Mingming
    Zhao, Huiying
    Yang, Yuedong
    NUCLEIC ACIDS RESEARCH, 2024, 52 (W1) : W248 - W255
  • [2] Geometry-Aware Adaptation for Pretrained Models
    Roberts, Nicholas
    Li, Xintong
    Adila, Dyah
    Cromp, Sonia
    Huang, Tzu-Heng
    Zhao, Jitian
    Sala, Frederic
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
    Shi, Zifan
    Xu, Yinghao
    Shen, Yujun
    Zhao, Deli
    Chen, Qifeng
    Yeung, Dit-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Geometry-Aware Neural Rendering
    Tobin, Josh
    Abbeel, Pieter
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [5] Geometry-aware Deep Transform
    Huang, Jiaji
    Qiu, Qiang
    Calderbank, Robert
    Sapiro, Guillermo
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4139 - 4147
  • [6] Tri-Perspective View Decomposition for Geometry-Aware Depth Completion
    Yan, Zhiqiang
    Lin, Yuankai
    Wang, Kun
    Zheng, Yupeng
    Wang, Yufei
    Zhang, Zhenyu
    Li, Jun
    Yang, Jian
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 4874 - 4884
  • [7] Geometry-Aware Neighborhood Search for Learning Local Models for Image Superresolution
    Ferreira, Julio Cesar
    Vural, Elif
    Guillemot, Christine
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (03) : 1354 - 1367
  • [8] Geometry-aware Tracking of Manipulability Ellipsoids
    Jaquier, Noemie
    Rozo, Leonel
    Caldwell, Darwin G.
    Calinon, Sylvain
    ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [9] Geometry-Aware Face Completion and Editing
    Song, Linsen
    Cao, Jie
    Song, Lingxiao
    Hu, Yibo
    He, Ran
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2506 - 2513
  • [10] Geometry-aware Dynamic Movement Primitives
    Abu-Dakka, Fares J.
    Kyrki, Ville
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4421 - 4426