LKD-STNN: A Lightweight Malicious Traffic Detection Method for Internet of Things Based on Knowledge Distillation

被引：2

作者：

Zhu, Shizhou ^{[1
]}

Xu, Xiaolong ^{[1
,2
]}

Zhao, Juan ^{[3
]}

Xiao, Fu ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210023, Peoples R China

[3] Jinling Inst Technol, Sch Comp Sci, Nanjing 211169, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Internet of Things; Deep learning; Computational modeling; Knowledge engineering; Complexity theory; Adaptation models; Neural networks; Internet of Things (IoT); lightweight knowledge distillation (KD); malicious traffic detection; AUTOMATIC MODULATION CLASSIFICATION; INTRUSION DETECTION; ATTACK;

D O I：

10.1109/JIOT.2023.3310794

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The purpose of malicious traffic detection and identification in the Internet of Things (IoT) is to detect the intrusion of malicious traffic within the IoT network into IoT devices. Detection and identification play a key role in ensuring the security of the IoT. At this time, great success has been achieved with deep learning in the field of malicious traffic detection and identification. However, due to resource limitations, such as computation weaknesses and low-edge network node storage capacity in the IoT, a high-complexity model based on deep learning cannot be deployed and applied. In this article, we propose a lightweight malicious traffic detection and recognition model named lightweight knowledge distillation space time neural network (LKD-STNN) based on knowledge distillation (KD) deep learning for the IoT. We use KD to build a lightweight student model by depthwise separable convolution and bidirectional long short-term memory (BiLSTM) to realize a lightweight student model and obtain multidimensional characteristic information. According to the characteristics of KD, we propose an adaptive temperature function that can adaptively and dynamically change the temperature during the process of knowledge transfer so that different softening characteristics can be obtained during the training process. Then, the weight is updated by combining loss functions to improve the performance of the student model. The experimental results show that with the publicly available malicious traffic data sets for the IoT, the ToN- IoT and IoT-23, our model not only reduces the complexity of the model and the number of model parameters to less than 1% of the teacher model but also reaches an accuracy of more than 98%, indicating that our model can be applied to the multiclassification identification of malicious traffic in the IoT.

引用

页码：6438 / 6453

页数：16