Text Adversarial Defense via Granular-Ball Sample Enhancement

被引:0
|
作者
Wang, Zeli [1 ]
Li, Jian [1 ]
Xia, Shuyin [1 ]
Lin, Longlong [2 ]
Wang, Guoyin [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Minist Educ, Key Lab Cyberspace Big Data Intelligent Secur, Chongqing, Peoples R China
[2] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural processing language; Adversarial defense; Clustering; Adversarial training; Sample enhancement;
D O I
10.1145/3652583.3658083
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has achieved outstanding performance in natural language processing, but actuality has witnessed its fragility against adversarial attacks. Synonyms-based attacks are most disastrous since their generated samples approximate raw inputs. Several countermeasures have been proposed in the literature, but the defense effectiveness is unsatisfactory because of the clumsy single-granularity synonyms clustering. To mitigate this dilemma, we propose a Granular-Ball Sample Enhancement-based defense Framework (GBSEF) for text adversarial attacks. Specifically, GBSEF first adopts an effective general synonyms clustering algorithm, which can adaptively adjust the granularity of synonym sets (i.e., granular-balls) for diverse datasets. Regarding each ball as a dot, the function consisting of most dots well fits the original data distribution, resulting in the relationships among words being well presented by the granular-balls. GBSEF then replaces each input word with the center vector of its subordinate ball, to construct robust samples preserving syntax and semantic information simultaneously. Finally, GBSEF combines a random substitution mechanism with granular-balls. This way can prompt GBSEF to take full advantage of the multi-granularity feature of granular-balls, to get more diverse valid samples. GBSEF obtains great performance through training on these samples. Abundant evaluations demonstrate the robustness and effectiveness of GBSEF against adversarial attacks, albeit with a slight performance decrease under normal scenarios without attacks. Meanwhile, GBSEF has good transferability against adversarial samples. Compared with state-of-art defense countermeasures, under multiple attacks on four neural network models (i.e., CNN, LSTM, Bi-LSTM, BERT), GBSEF always outperforms existing baselines.
引用
收藏
页码:348 / 356
页数:9
相关论文
共 50 条
  • [41] Customizable text generation via conditional text generative adversarial network
    Chen, Jinyin
    Wu, Yangyang
    Jia, Chengyu
    Zheng, Haibin
    Huang, Guohan
    NEUROCOMPUTING, 2020, 416 (416) : 125 - 135
  • [42] Lightweight Privacy Protection via Adversarial Sample
    Xie, Guangxu
    Hou, Gaopan
    Pei, Qingqi
    Huang, Haibo
    ELECTRONICS, 2024, 13 (07)
  • [43] 3WC-GBNRS++: A Novel Three-Way Classifier With Granular-Ball Neighborhood Rough Sets Based on Uncertainty
    Yang, Jie
    Liu, Zhuangzhuang
    Xia, Shuyin
    Wang, Guoyin
    Zhang, Qinghua
    Li, Shuai
    Xu, Taihua
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (08) : 4376 - 4387
  • [44] AuxBlocks: Defense Adversarial Examples via Auxiliary Blocks
    Yu, Yueyao
    Yu, Pengfei
    Li, Wenye
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [45] Adversarial Defense via Learning to Generate Diverse Attacks
    Jang, Yunseok
    Zhao, Tianchen
    Hong, Seunghoon
    Lee, Honglak
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2740 - 2749
  • [46] Generative adversarial defense via conditional diffusion model
    Shi, Xiaowen
    Zhou, Chao
    Wang, Yuan-Gen
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [47] An Adversarial sample defense method based on multi-scale GAN
    Shao, Mingwen
    Liu, Shuqi
    Wang, Ran
    Zhang, Gaozhi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (12) : 3437 - 3447
  • [48] An Adversarial sample defense method based on multi-scale GAN
    Mingwen Shao
    Shuqi Liu
    Ran Wang
    Gaozhi Zhang
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 3437 - 3447
  • [49] Text-to-Sketch Synthesis via Adversarial Network
    Martis, Jason Elroy
    Shetty, Sannidhan Manjaya
    Pradhan, Manas Ranjan
    Desai, Usha
    Acharya, Biswaranjan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (01): : 915 - 938
  • [50] Adversarial Text Generation via Sequence Contrast Discrimination
    Wang, Ke
    Wan, Xiaojun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 47 - 53