Text Adversarial Defense via Granular-Ball Sample Enhancement

被引:0
|
作者
Wang, Zeli [1 ]
Li, Jian [1 ]
Xia, Shuyin [1 ]
Lin, Longlong [2 ]
Wang, Guoyin [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Minist Educ, Key Lab Cyberspace Big Data Intelligent Secur, Chongqing, Peoples R China
[2] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural processing language; Adversarial defense; Clustering; Adversarial training; Sample enhancement;
D O I
10.1145/3652583.3658083
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has achieved outstanding performance in natural language processing, but actuality has witnessed its fragility against adversarial attacks. Synonyms-based attacks are most disastrous since their generated samples approximate raw inputs. Several countermeasures have been proposed in the literature, but the defense effectiveness is unsatisfactory because of the clumsy single-granularity synonyms clustering. To mitigate this dilemma, we propose a Granular-Ball Sample Enhancement-based defense Framework (GBSEF) for text adversarial attacks. Specifically, GBSEF first adopts an effective general synonyms clustering algorithm, which can adaptively adjust the granularity of synonym sets (i.e., granular-balls) for diverse datasets. Regarding each ball as a dot, the function consisting of most dots well fits the original data distribution, resulting in the relationships among words being well presented by the granular-balls. GBSEF then replaces each input word with the center vector of its subordinate ball, to construct robust samples preserving syntax and semantic information simultaneously. Finally, GBSEF combines a random substitution mechanism with granular-balls. This way can prompt GBSEF to take full advantage of the multi-granularity feature of granular-balls, to get more diverse valid samples. GBSEF obtains great performance through training on these samples. Abundant evaluations demonstrate the robustness and effectiveness of GBSEF against adversarial attacks, albeit with a slight performance decrease under normal scenarios without attacks. Meanwhile, GBSEF has good transferability against adversarial samples. Compared with state-of-art defense countermeasures, under multiple attacks on four neural network models (i.e., CNN, LSTM, Bi-LSTM, BERT), GBSEF always outperforms existing baselines.
引用
收藏
页码:348 / 356
页数:9
相关论文
共 50 条
  • [31] Three-Way Approximations Fusion With Granular-Ball Computing to Guide Multigranularity Fuzzy Entropy for Feature Selection
    Xia, Deyou
    Wang, Guoyin
    Zhang, Qinghua
    Yang, Jie
    Xia, Shuyin
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (10) : 5963 - 5977
  • [32] Adversarial Text Purification: A Large Language Model Approach for Defense
    Moraffah, Raha
    Khandelwal, Shubh
    Bhattacharjee, Amrita
    Liu, Huan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT V, PAKDD 2024, 2024, 14649 : 65 - 77
  • [33] Text Adversarial Examples Generation and Defense Based on Reinforcement Learning
    Li, Yue
    Xu, Pengjian
    Ruan, Qing
    Xu, Wusheng
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2021, 28 (04): : 1306 - 1314
  • [34] ADVERSARIAL DEFENSE VIA LOCAL FLATNESS REGULARIZATION
    Xu, Jia
    Li, Yiming
    Jiang, Yong
    Xia, Shu-Tao
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2196 - 2200
  • [35] Granular ball-based fuzzy multineighborhood rough set for feature selection via label enhancement
    Sun, Lin
    Du, Wenjuan
    Ding, Weiping
    Long, Qian
    Xu, Jiucheng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 145
  • [36] Adversarial Sample Attack and Defense Method for Encrypted Traffic Data
    Ding, Yi
    Zhu, Guiqin
    Chen, Dajiang
    Qin, Xue
    Cao, Mingsheng
    Qin, Zhiguang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18024 - 18039
  • [37] DNN Robustness Enhancement Based on Adversarial Sample Prioritization
    Zhang, Long
    Wu, Jiangzhao
    Ma, Siyuan
    Liu, Jian
    IEEE ACCESS, 2024, 12 : 147860 - 147881
  • [38] Rule-based adversarial sample generation for text classification
    Nai Zhou
    Nianmin Yao
    Jian Zhao
    Yanan Zhang
    Neural Computing and Applications, 2022, 34 : 10575 - 10586
  • [39] Rule-based adversarial sample generation for text classification
    Zhou, Nai
    Yao, Nianmin
    Zhao, Jian
    Zhang, Yanan
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10575 - 10586
  • [40] Prediction of hydrological and water quality data based on granular-ball rough set and k-nearest neighbor analysis
    Dong, Limei
    Zuo, Xinyu
    Xiong, Yiping
    PLOS ONE, 2024, 19 (02):