Polymorphic Malware Detection Using Sequence Classification Methods

被引:39
|
作者
Drew, Jake [1 ]
Moore, Tyler [2 ]
Hahsler, Michael [1 ]
机构
[1] Southern Methodist Univ, Comp Sci & Engn Dept, Dallas, TX 75275 USA
[2] Univ Tulsa, Tandy Sch Comp Sci, Tulsa, OK 74104 USA
关键词
SEARCH;
D O I
10.1109/SPW.2016.30
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Polymorphic malware detection is challenging due to the continual mutations miscreants introduce to successive instances of a particular virus. Such changes are akin to mutations in biological sequences. Recently, high-throughput methods for gene sequence classification have been developed by the bioinformatics and computational biology communities. In this paper, we argue that these methods can be usefully applied to malware detection. Unfortunately, gene classification tools are usually optimized for and restricted to an alphabet of four letters ( nucleic acids). Consequently, we have selected the Strand gene sequence classifier, which offers a robust classification strategy that can easily accommodate unstructured data with any alphabet including source code or compiled machine code. To demonstrate Stand's suitability for classifying malware, we execute it on approximately 500GB of malware data provided by the Kaggle Microsoft Malware Classification Challenge (BIG 2015) used for predicting 9 classes of polymorphic malware. Experiments show that, with minimal adaptation, the method achieves accuracy levels well above 95% requiring only a fraction of the training times used by the winning team's method.
引用
收藏
页码:81 / 87
页数:7
相关论文
共 50 条
  • [1] Polymorphic malware detection using sequence classification methods and ensembles: BioSTAR 2016 Recommended Submission - EURASIP Journal on Information Security
    Drew J.
    Hahsler M.
    Moore T.
    Drew, Jake (jakemdrew@gmail.com), 1600, Springer International Publishing (2017):
  • [2] Polymorphic Malware Detection
    Selamat, Nur Syuhada
    Ali, Fakariah Hani Mohd
    Abu Othman, Noor Ashitah
    2016 6TH INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS 2016), 2016, : 274 - 278
  • [3] Advanced Windows Methods on Malware Detection and Classification
    Rabadi, Dima
    Teo, Sin G.
    36TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSAC 2020), 2020, : 54 - 68
  • [4] Improvement of malware detection and classification using API call sequence alignment and visualization
    Hyunjoo Kim
    Jonghyun Kim
    Youngsoo Kim
    Ikkyun Kim
    Kuinam J. Kim
    Hyuncheol Kim
    Cluster Computing, 2019, 22 : 921 - 929
  • [5] Improvement of malware detection and classification using API call sequence alignment and visualization
    Kim, Hyunjoo
    Kim, Jonghyun
    Kim, Youngsoo
    Kim, Ikkyun
    Kim, Kuinam J.
    Kim, Hyuncheol
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 921 - 929
  • [6] Malware Classification using Byte Sequence Information
    Jung, Byungho
    Kim, Taeguen
    Im, Eul Gyu
    PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 143 - 148
  • [7] Malware Detection and Classification Based on Parallel Sequence Comparison
    Ding, Hao
    Sun, Wenjie
    Chen, Yihang
    Zhao, Binglin
    Gui, Hairen
    2018 5TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2018, : 670 - 675
  • [8] Malware Analysis and Classification Using Sequence Alignments
    Cho, In Kyeom
    Kim, Tae Guen
    Shim, Yu Jin
    Ryu, Minsoo
    Im, Eul Gyu
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2016, 22 (03): : 371 - 377
  • [9] Detection and Retrieval of Malware Using Classification
    Makandar, Aziz
    Patrot, Anita
    2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [10] Malware Classification Using Deep Learning Methods
    Cakir, Bugra
    Dogdu, Erdogan
    ACMSE '18: PROCEEDINGS OF THE ACMSE 2018 CONFERENCE, 2018,