AutoMal: automatic clustering and signature generation for malwares based on the network flow

被引:1
|
作者
Hao, Sun [1 ,2 ]
Wang, Wen [1 ]
Lu, Huabiao [1 ]
Ren, Peige [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha, Hunan, Peoples R China
[2] Natl Univ Def Technol, Natl Key Lab Parallel & Distributed Proc, Changsha, Hunan, Peoples R China
基金
美国国家科学基金会; 国家高技术研究发展计划(863计划);
关键词
network malware; clustering; signature generation; matrix; feature hashing;
D O I
10.1002/sec.1029
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The volume of malwares is growing at an exponential speed nowadays. This huge growth makes it extremely hard to analyse malware manually. Most existing signatures extracting methods are based on string signatures, and string matching is not accurate and time consuming. Therefore, this paper presents AutoMal, a system for automatically extracting signatures from large-scale malwares. Firstly, the system proposes to represent the network flows by using feature hashing, which can dramatically reduce the high-dimensional feature spaces that are general in malware analysis. Then, we design a clustering and median filtering method to classify the malware vectors into different types. Finally, it introduces the signature generation algorithm based on Bayesian method. The system can extract both the byte signature and the hash signature of malwares from its network flow with low false positive and zero false negative. Our evaluation shows that AutoMal can generate strongly noise-resisted signatures that exactly depict the characteristics of malware. Copyright (c) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:1845 / 1854
页数:10
相关论文
共 50 条
  • [31] The information flow clustering model and algorithm based on the Artificial Bee Colony mechanism of PPI network
    Lei, X.-J. (xjlei168@163.com), 1600, Science Press (35): : 134 - 145
  • [32] Knowledge Base Generation Based on Fuzzy Clustering
    T. A. Moiseeva
    T. M. Ledeneva
    Programming and Computer Software, 2023, 49 : S99 - S107
  • [33] Automatic Stop List Generation for Clustering Recognition Results of Call Center Recordings
    Popova, Svetlana
    Krivosheeva, Tatiana
    Korenevsky, Maxim
    SPEECH AND COMPUTER, 2014, 8773 : 137 - 144
  • [34] Knowledge Base Generation Based on Fuzzy Clustering
    Moiseeva, T. A.
    Ledeneva, T. M.
    PROGRAMMING AND COMPUTER SOFTWARE, 2023, 49 (SUPPL 2) : S99 - S107
  • [35] Rapid analysis of metagenomic data using signature-based clustering
    Timothy Chappell
    Shlomo Geva
    James M. Hogan
    Flavia Huygens
    Irani U. Rathnayake
    Stephen Rudd
    Wayne Kelly
    Dimitri Perrin
    BMC Bioinformatics, 19
  • [36] Clustering in mobile ad hoc network based on neural network
    Chen Ai-bin
    Cai Zi-xing
    Hu De-wen
    JOURNAL OF CENTRAL SOUTH UNIVERSITY OF TECHNOLOGY, 2006, 13 (06): : 699 - 702
  • [37] Clustering in mobile ad hoc network based on neural network
    陈爱斌
    蔡自兴
    胡德文
    Journal of Central South University of Technology(English Edition), 2006, (06) : 699 - 702
  • [38] A Novel Scalable Signature Based Subspace Clustering Approach for Big Data
    Gayathri, T.
    Bhaskari, D. Lalitha
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2019, 14 (02) : 41 - 51
  • [39] Clustering in mobile ad hoc network based on neural network
    Ai-bin Chen
    Zi-xing Cai
    De-wen Hu
    Journal of Central South University of Technology, 2006, 13 : 699 - 702
  • [40] Rapid analysis of metagenomic data using signature-based clustering
    Chappell, Timothy
    Geva, Shlomo
    Hogan, James M.
    Huygens, Flavia
    Rathnayake, Irani U.
    Rudd, Stephen
    Kelly, Wayne
    Perrin, Dimitri
    BMC BIOINFORMATICS, 2018, 19