SSCL-TransMD: Semi-Supervised Continual Learning Transformer for Malicious Software Detection

被引:1
|
作者
Kou, Liang [1 ,2 ]
Zhao, Donghui [2 ]
Han, Hui [1 ]
Xu, Xiong [1 ]
Gong, Shuaige [1 ]
Wang, Liandong [1 ]
机构
[1] State Key Lab Complex Electromagnet Environm Effec, Luoyang 471000, Peoples R China
[2] Hangzhou Dianzi Univ, Coll Cyberspace, Hangzhou 310018, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 22期
关键词
android malware detection; deep learning; transformer; semi-supervised continual learning; MALWARE DETECTION;
D O I
10.3390/app132212255
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Machine learning-based malware (malicious software) detection methods have a wide range of real-world applications. However, these types of approaches suffer from the fatal problem of "model aging", in which the validity of the model decreases rapidly as the malware continues to evolve and variants emerge continuously. The model aging problem is usually solved by model retraining, which relies on lots of labeled samples obtained at great expense. To address this challenge, this paper proposes a semi-supervised continuous learning malware detection model based on Transformer. Firstly, this model improves the lifelong semi-supervised mixture algorithm to dynamically adjust the weighted combination of new sample sequences and historical ones to solve the imbalance problem. Secondly, the Learning with Local and Global Consistency algorithm is used to iteratively compute similarity scores for the unlabeled samples in the mixed samples to obtain pseudo-labels. Lastly, the Multilayer Perceptron is applied for malware classification. To validate the effectiveness of the model, this paper conducts experiments on the CICMalDroid2020 dataset. The experimental results show that the proposed model performs better than existing deep learning detection models. The F1 score has an average improvement of 1.27% compared to other models when conducting binary classification. And, after inputting hybrid samples, including historical data and new data, four times, the F1 score is still 1.96% higher than other models.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Semi-supervised Malicious Domain Detection Based on Meta Pseudo Labeling
    Gao, Yi
    Yuan, Fangfang
    Yang, Jinglin
    Wang, Dakui
    Cao, Cong
    Liu, Yanbing
    COMPUTATIONAL SCIENCE, ICCS 2024, PT II, 2024, 14833 : 312 - 324
  • [22] Software fault localization using semi-supervised learning
    Zheng, Wei
    Wu, Xiaoxue
    Tan, Xin
    Peng, Yaopeng
    Yang, Shuai
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2015, 33 (02): : 332 - 336
  • [23] FRUGAL: Unlocking Semi-Supervised Learning for Software Analytics
    Tu, Huy
    Menzies, Tim
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 394 - 406
  • [24] Semi-supervised Learning Framework for UAV Detection
    Medaiyese, Olusiji O.
    Ezuma, Martins
    Lauf, Adrian P.
    Guvenc, Ismail
    2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
  • [25] Semi-Supervised Learning for Cervical Precancer Detection
    Angara, Sandeep
    Guo, Peng
    Xue, Zhiyun
    Antani, Sameer
    2021 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2021, : 202 - 206
  • [26] Semi-supervised Anomaly Detection with Reinforcement Learning
    Lee, Changheon
    Kim, JoonKyu
    Kang, Suk-Ju
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 933 - 936
  • [27] Semi-Supervised Active Learning for Object Detection
    Chen, Sijin
    Yang, Yingyun
    Hua, Yan
    ELECTRONICS, 2023, 12 (02)
  • [28] Semi-supervised Learning for Unknown Malware Detection
    Santos, Igor
    Nieves, Javier
    Bringas, Pablo G.
    INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2011, 91 : 415 - 422
  • [29] Proposal Learning for Semi-Supervised Object Detection
    Tang, Peng
    Ramaiah, Chetan
    Wang, Yan
    Xu, Ran
    Xiong, Caiming
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2290 - 2300
  • [30] A semi-supervised learning model for intrusion detection
    Jiang, Eric P.
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2019, 13 (03): : 343 - 353