Dynamic Prototype Network Based on Sample Adaptation for Few-Shot Malware Detection

被引:57
作者
Chai, Yuhan [1 ]
Du, Lei [2 ,3 ]
Qiu, Jing [1 ]
Yin, Lihua [1 ]
Tian, Zhihong [1 ]
机构
[1] Guangzhou Univ, Cyberspace Inst Adv Technol, Guangzhou 510006, Guangdong, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Guangdong, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518066, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Malware; Feature extraction; Semantics; Convolutional neural networks; Prototypes; Data models; Deep learning; Feature representation; neural nets; similarity measures; security;
D O I
10.1109/TKDE.2022.3142820
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The continuous increase and spread of malware have caused immeasurable losses to social enterprises and even the country, especially unknown malware. Most existing methods use predefined class samples to train models, which cannot handle unknown malware detection. In this paper, we formalize unknown malware detection as a Few-Shot Learning problem. However, the existing model cannot dynamically adjust the model parameters according to the samples and does not deeply consider the influence of the correlation between samples, so it achieves sub-optimal performance. We propose a Dynamic Prototype Network based on Sample Adaptation for few-shot malware detection (DPNSA). Specifically, we use dynamic convolution to realize dynamic feature extraction based on sample adaptation. Secondly, we define the class feature (prototype) as the mean of the dynamic embedding of all malware samples of each class in the support set. Then, a dual-sample dynamic activation function is proposed, which uses the correlation of the dual-sample to reduce the impact of unrelated features between samples on the metric. Finally, we use the metric-based method to calculate the distance between the query sample and the prototype to realize malware detection. Experiments show that our method outperforms the existing few-shot malware detection models and achieves significant improvement.
引用
收藏
页码:4754 / 4766
页数:13
相关论文
共 56 条
  • [1] Abou-Assaleh T, 2004, P INT COMP SOFTW APP, P41
  • [2] Unsuccessful Story about Few Shot Malware Family Classification and Siamese Network to the Rescue
    Bai, Yude
    Xing, Zhenchang
    Li, Xiaohong
    Feng, Zhiyong
    Ma, Duoyuan
    [J]. 2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 1560 - 1571
  • [3] Bhodia N, 2019, Arxiv, DOI arXiv:1903.11551
  • [4] Chai YH, 2020, INT WIREL COMMUN, P463, DOI 10.1109/IWCMC48107.2020.9148289
  • [5] Chen Y., 2020, COMPUTER VISION ECCV, P351
  • [6] Dynamic Convolution: Attention over Convolution Kernels
    Chen, Yinpeng
    Dai, Xiyang
    Liu, Mengchen
    Chen, Dongdong
    Yuan, Lu
    Liu, Zicheng
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11027 - 11036
  • [7] Malware Classification Using Attention-Based Transductive Learning Network
    Deng, Liting
    Wen, Hui
    Xin, Mingfeng
    Sun, Yue
    Sun, Limin
    Zhu, Hongsong
    [J]. SECURITY AND PRIVACY IN COMMUNICATION NETWORKS (SECURECOMM 2020), PT II, 2020, 336 : 403 - 418
  • [8] Finn C, 2017, PR MACH LEARN RES, V70
  • [9] Gao TY, 2019, AAAI CONF ARTIF INTE, P6407
  • [10] Gao Y, 2020, AAAI CONF ARTIF INTE, V34, P10818