A Textual Backdoor Defense Method Based on Deep Feature Classification

被引:2
作者
Shao, Kun [1 ]
Yang, Junan [1 ]
Hu, Pengjiang [1 ]
Li, Xiaoshuai [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Engn, Hefei 230037, Peoples R China
关键词
deep neural networks; natural language processing; adversarial machine learning; backdoor attacks; backdoor defenses; ATTACKS;
D O I
10.3390/e25020220
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Natural language processing (NLP) models based on deep neural networks (DNNs) are vulnerable to backdoor attacks. Existing backdoor defense methods have limited effectiveness and coverage scenarios. We propose a textual backdoor defense method based on deep feature classification. The method includes deep feature extraction and classifier construction. The method exploits the distinguishability of deep features of poisoned data and benign data. Backdoor defense is implemented in both offline and online scenarios. We conducted defense experiments on two datasets and two models for a variety of backdoor attacks. The experimental results demonstrate the effectiveness of this defense approach and outperform the baseline defense method.
引用
收藏
页数:13
相关论文
共 50 条
[41]   APPROACHES TO SAMPLES SELECTION FOR MACHINE LEARNING BASED CLASSIFICATION OF TEXTUAL DATA [J].
Darena, Frantisek ;
Zizka, Jan .
COMPUTING AND INFORMATICS, 2013, 32 (05) :949-967
[42]   Transformer-based Bug/Feature Classification [J].
Ozturk, Ceyhun E. ;
Yilmaz, Eyup Halit ;
Koksal, Omer .
2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
[43]   Adversarial Deep Learning: A Survey on Adversarial Attacks and Defense Mechanisms on Image Classification [J].
Khamaiseh, Samer Y. ;
Bagagem, Derek ;
Al-Alaj, Abdullah ;
Mancino, Mathew ;
Alomari, Hakam W. .
IEEE ACCESS, 2022, 10 :102266-102291
[44]   Feature visualization in comic artist classification using deep neural networks [J].
Young-Min, Kim .
JOURNAL OF BIG DATA, 2019, 6 (01)
[45]   Integration of Feature and Decision Fusion With Deep Learning Architectures for Video Classification [J].
Kiziltepe, Rukiye Savran ;
Gan, John Q. ;
Escobar, Juan Jose .
IEEE ACCESS, 2024, 12 :19432-19446
[46]   Feature visualization in comic artist classification using deep neural networks [J].
Kim Young-Min .
Journal of Big Data, 6
[47]   ALL classification using neural ensemble and memetic deep feature optimization [J].
Awais, Muhammad ;
Ahmad, Riaz ;
Kausar, Nabeela ;
Alzahrani, Ahmed Ibrahim ;
Alalwan, Nasser ;
Masood, Anum .
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
[48]   ROM-based Inference Method Built on Deep Learning for Sleep Stage Classification [J].
AlMeer, Mohamed H. ;
Hassen, Hanadi ;
Nawaz, Naveed .
TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2019, 8 (01) :28-40
[49]   Study on Express Parcels Classification Encoding Method based on Deep Convolutional Neural Network [J].
Wang, Linlin .
INTERNATIONAL JOURNAL OF MULTIPHYSICS, 2024, 18 (03) :330-337
[50]   Securing ML-based Android Malware Detectors: A Defensive Feature Selection Approach against Backdoor Attacks [J].
Marek, Bartlomiej ;
Pieniazek, Kacper ;
Ratajczak, Filip ;
Adamczyk, Wojciech ;
Bok, Bartosz ;
Krzyszton, Mateusz .
2024 IEEE 24TH INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW 2024, 2024, :128-135