Text Adversarial Defense via Granular-Ball Sample Enhancement

被引:0
|
作者
Wang, Zeli [1 ]
Li, Jian [1 ]
Xia, Shuyin [1 ]
Lin, Longlong [2 ]
Wang, Guoyin [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Minist Educ, Key Lab Cyberspace Big Data Intelligent Secur, Chongqing, Peoples R China
[2] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural processing language; Adversarial defense; Clustering; Adversarial training; Sample enhancement;
D O I
10.1145/3652583.3658083
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has achieved outstanding performance in natural language processing, but actuality has witnessed its fragility against adversarial attacks. Synonyms-based attacks are most disastrous since their generated samples approximate raw inputs. Several countermeasures have been proposed in the literature, but the defense effectiveness is unsatisfactory because of the clumsy single-granularity synonyms clustering. To mitigate this dilemma, we propose a Granular-Ball Sample Enhancement-based defense Framework (GBSEF) for text adversarial attacks. Specifically, GBSEF first adopts an effective general synonyms clustering algorithm, which can adaptively adjust the granularity of synonym sets (i.e., granular-balls) for diverse datasets. Regarding each ball as a dot, the function consisting of most dots well fits the original data distribution, resulting in the relationships among words being well presented by the granular-balls. GBSEF then replaces each input word with the center vector of its subordinate ball, to construct robust samples preserving syntax and semantic information simultaneously. Finally, GBSEF combines a random substitution mechanism with granular-balls. This way can prompt GBSEF to take full advantage of the multi-granularity feature of granular-balls, to get more diverse valid samples. GBSEF obtains great performance through training on these samples. Abundant evaluations demonstrate the robustness and effectiveness of GBSEF against adversarial attacks, albeit with a slight performance decrease under normal scenarios without attacks. Meanwhile, GBSEF has good transferability against adversarial samples. Compared with state-of-art defense countermeasures, under multiple attacks on four neural network models (i.e., CNN, LSTM, Bi-LSTM, BERT), GBSEF always outperforms existing baselines.
引用
收藏
页码:348 / 356
页数:9
相关论文
共 50 条
  • [21] GB-DBSCAN: A fast granular-ball based DBSCAN clustering algorithm
    Cheng, Dongdong
    Zhang, Cheng
    Li, Ya
    Xia, Shuyin
    Wang, Guoyin
    Huang, Jinlong
    Zhang, Sulan
    Xie, Jiang
    INFORMATION SCIENCES, 2024, 674
  • [22] GBRAIN: Combating Textual Label Noise by Granular-ball based Robust Training
    Wang, Zeli
    Zhang, Tuo
    Xia, Shuyin
    Lin, Longlong
    Wang, Guoyin
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 357 - 365
  • [23] Fast fixed granular-ball for attribute reduction in label noise environments and its application in medical diagnosis
    Peng, Xiaoli
    Wang, Ping
    Shao, Yabin
    Gong, Yuanlin
    Qian, Jie
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (03) : 1039 - 1054
  • [24] Clinical Medical Test Decision-Making of Liver Disease Using Granular-Ball Rough Set
    Xu, Fanxin
    Su, Zuqiang
    Wang, Guoyin
    ROUGH SETS, PT II, IJCRS 2024, 2024, 14840 : 265 - 279
  • [25] Granular-ball computing-based manifold clustering algorithms for ultra-scalable data
    Cheng, Dongdong
    Liu, Shushu
    Xia, Shuyin
    Wang, Guoyin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
  • [26] GBRS: A Unified Granular-Ball Learning Model of Pawlak Rough Set and Neighborhood Rough Set
    Xia, Shuyin
    Wang, Cheng
    Wang, Guoyin
    Gao, Xinbo
    Ding, Weiping
    Yu, Jianhang
    Zhai, Yujia
    Chen, Zizhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1719 - 1733
  • [27] Multi-label feature selection for missing labels by granular-ball based mutual information
    Shu, Wenhao
    Hu, Yichen
    Qian, Wenbin
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12589 - 12612
  • [28] A novel covering rough set model based on granular-ball computing for data with label noise
    Peng, Xiaoli
    Gong, Yuanlin
    Hou, Xiang
    Tang, Zhan
    Shao, Yabin
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2025, 182
  • [29] Granular-ball computing: an efficient, robust, and interpretable adaptive multi-granularity representation and computation method
    Xia, Shuyin
    Wang, Guoyin
    Gao, Xinbo
    Lian, Xiaoyu
    arXiv, 2023,
  • [30] Fast fixed granular-ball for attribute reduction in label noise environments and its application in medical diagnosis
    Xiaoli Peng
    Ping Wang
    Yabin Shao
    Yuanlin Gong
    Jie Qian
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1039 - 1054