Backdoor Attacks Against Transfer Learning With Pre-Trained Deep Learning Models

被引:51
作者
Wang, Shuo [1 ]
Nepal, Surya [2 ]
Rudolph, Carsten [3 ]
Grobler, Marthie [2 ]
Chen, Shangyu [4 ]
Chen, Tianle [3 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[2] CSIROs Data61, Melbourne, Vic 3008, Australia
[3] Monash Univ, Fac Informat Technol, Melbourne, Vic 3800, Australia
[4] Univ Melbourne, Melbourne, Vic 3010, Australia
关键词
Web service; deep neural network; backdoor attack; transfer learning; pre-trained model;
D O I
10.1109/TSC.2020.3000900
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transfer learning provides an effective solution for feasibly and fast customize accurate Student models, by transferring the learned knowledge of pre-trained Teacher models over large datasets via fine-tuning. Many pre-trained Teacher models used in transfer learning are publicly available and maintained by public platforms, increasing their vulnerability to backdoor attacks. In this article, we demonstrate a backdoor threat to transfer learning tasks on both image and time-series data leveraging the knowledge of publicly accessible Teacher models, aimed at defeating three commonly adopted defenses: pruning-based, retraining-based and input pre-processing-based defenses. Specifically, (A) ranking-based selection mechanism to speed up the backdoor trigger generation and perturbation process while defeating pruning-based and/or retraining-based defenses. (B) autoencoder-powered trigger generation is proposed to produce a robust trigger that can defeat the input pre-processing-based defense, while guaranteeing that selected neuron (s) can be significantly activated. (C) defense-aware retraining to generate the manipulated model using reverse-engineered model inputs. We launch effective misclassification attacks on Student models over real-world images, brain Magnetic Resonance Imaging (MRI) data and Electrocardiography (ECG) learning systems. The experiments reveal that our enhanced attack can maintain the 98.4 and 97.2 percent classification accuracy as the genuine model on clean image and time series inputs while improving 27.9% - 100% and 27.1% - 56.1% attack success rate on trojaned image and time series inputs respectively in the presence of pruning-based and/or retraining-based defenses.
引用
收藏
页码:1526 / 1539
页数:14
相关论文
共 50 条
[31]   Transfer Learning in Pre-Trained Large Language Models for Malware Detection Based on System Calls [J].
Sanchez Sanchez, Pedro Miguel ;
Huertas Celdran, Alberto ;
Bovet, Gerome ;
Martinez Perez, Gregorio .
MILCOM 2024-2024 IEEE MILITARY COMMUNICATIONS CONFERENCE, MILCOM, 2024, :853-858
[32]   Backdoor attacks against distributed swarm learning [J].
Chen, Kongyang ;
Zhang, Huaiyuan ;
Feng, Xiangyu ;
Zhang, Xiaoting ;
Mi, Bing ;
Jin, Zhiping .
ISA TRANSACTIONS, 2023, 141 :59-72
[33]   Identification of Corneal Ulcers with Pre-Trained AlexNet Based on Transfer Learning [J].
Cinar, Ilkay ;
Taspinar, Y. Selim ;
Kursun, Ramazan ;
Koklu, Murat .
2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, :631-634
[34]   Facial age estimation using pre-trained CNN and transfer learning [J].
Issam Dagher ;
Dany Barbara .
Multimedia Tools and Applications, 2021, 80 :20369-20380
[35]   Transfer learning of pre-trained CNNs on digital transaction fraud detection [J].
Tekkali, Chandana Gouri ;
Natarajan, Karthika .
INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2024, 28 (03) :571-580
[36]   Comparison of Pre-Trained CNNs for Audio Classification Using Transfer Learning [J].
Tsalera, Eleni ;
Papadakis, Andreas ;
Samarakou, Maria .
JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2021, 10 (04)
[37]   Facial age estimation using pre-trained CNN and transfer learning [J].
Dagher, Issam ;
Barbara, Dany .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (13) :20369-20380
[38]   A hybrid model in transfer learning based pre-trained model and a scale factor updating [J].
Jiao, Peng ;
Pei, Jing .
2018 JOINT 7TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2018 2ND INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2018, :538-543
[39]   Adversarially Guided Stateful Defense Against Backdoor Attacks in Federated Deep Learning [J].
Ali, Hassan ;
Nepal, Surya ;
Kanhere, Salil S. ;
Jha, Sanjay .
2024 ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC, 2024, :794-809
[40]   Backdoor Attacks Against Deep Learning-based Massive MIMO Localization [J].
Zhao, Tianya ;
Wang, Xuyu ;
Mao, Shiwen .
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, :2796-2801