Detecting Malware with an Ensemble Method Based on Deep Neural Network

被引:111
作者
Yan, Jinpei [1 ]
Qi, Yong [1 ]
Rao, Qifan [1 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1155/2018/7247095
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Malware detection plays a crucial role in computer security. Recent researches mainly use machine learning based methods heavily relying on domain knowledge for manually extracting malicious features. In this paper, we propose MalNet, a novel malware detection method that learns features automatically from the raw data. Concretely, we first generate a grayscale image from malware file, meanwhile extracting its opcode sequences with the decompilation tool IDA. Then MalNet uses CNN and LSTM networks to learn from grayscale image and opcode sequence, respectively, and takes a stacking ensemble for malware classification. We perform experiments on more than 40,000 samples including 20,650 benign files collected from online software providers and 21,736 malwares provided by Microsoft. The evaluation result shows that MalNet achieves 99.88% validation accuracy for malware detection. In addition, we also take malware family classification experiment on 9 malware families to compare MalNet with other related works, in which MalNet outperforms most of related works with 99.36% detection accuracy and achieves a considerable speed-up on detecting efficiency comparing with two state-of-the-art results on Microsoft malware dataset.
引用
收藏
页数:16
相关论文
共 45 条
  • [1] Novel Feature Extraction, Selection and Fusion for Effective Malware Family Classification
    Ahmadi, Mansour
    Ulyanov, Dmitry
    Semenov, Stanislav
    Trofimov, Mikhail
    Giacinto, Giorgio
    [J]. CODASPY'16: PROCEEDINGS OF THE SIXTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, 2016, : 183 - 194
  • [2] [Anonymous], Kaspersky security bulletin 2015
  • [3] Biggio B., 2013, P 2013 EUROPEAN C MA, DOI DOI 10.1007/978-3-642-40994-3_25
  • [4] Burnaev E., ONE CLASS SVM PRIVIL
  • [5] Carlini N., Defensive Distillation is Not Robust to Adversarial Examples
  • [6] Carlini N., ADVERSARIAL EXAMPLES
  • [7] Cohen W. W., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P115
  • [8] Polymorphic malware detection using sequence classification methods and ensembles: BioSTAR 2016 Recommended Submission - EURASIP Journal on Information Security
    Drew J.
    Hahsler M.
    Moore T.
    [J]. Drew, Jake (jakemdrew@gmail.com), 1600, Springer International Publishing (2017):
  • [9] Garcia F. C., RANDOM FOREST MALWAR
  • [10] Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    Graves, A
    Schmidhuber, J
    [J]. NEURAL NETWORKS, 2005, 18 (5-6) : 602 - 610