Obfuscated VBA Macro Detection Using Machine Learning

被引:32
作者
Kim, Sangwoo [1 ]
Hong, Seokmyung [1 ]
Oh, Jaesang [1 ]
Lee, Heejo [1 ]
机构
[1] Korea Univ, Seoul, South Korea
来源
2018 48TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN) | 2018年
关键词
!text type='JAVA']JAVA[!/text]SCRIPT;
D O I
10.1109/DSN.2018.00057
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Malware using document files as an attack vector has continued to increase and now constitutes a large portion of phishing attacks. To avoid anti-virus detection, malware writers usually implement obfuscation techniques in their source code. Although obfuscation is related to malicious code detection, little research has been conducted on obfuscation with regards to Visual Basic for Applications (VBA) macros. In this paper, we summarize the obfuscation techniques and propose an obfuscated macro code detection method using five machine learning classifiers. To train these classifiers, our proposed method uses 15 discriminant static features, taking into account the characteristics of the VBA macros. We evaluated our approach using a real-world dataset of obfuscated and non-obfuscated VBA macros extracted from Microsoft Office document files. The experimental results demonstrate that our detection approach achieved a F-2 score improvement of greater than 23% compared to those of related studies.
引用
收藏
页码:490 / 501
页数:12
相关论文
共 43 条
[1]  
Aebersold Simon, 2016, P 11 INT C INT MON P
[2]  
[Anonymous], 1997, Tech. Rep
[3]  
[Anonymous], 1998, LEARNING TEXT CATEGO
[4]  
[Anonymous], MS OFORMS OFF FORMS
[5]  
[Anonymous], 2009, NEURAL NETWORKS LEAR
[6]  
Beek C., 2017, TECH REP
[7]  
Behrisch M, 2012, IEEE CONF VIS ANAL, P209, DOI 10.1109/VAST.2012.6400549
[8]  
Chantry G., 2014, LABS VBA IS DEFINITE
[9]  
Chi D., 2000, TECH REP
[10]  
Choi Y, 2009, LECT NOTES COMPUT SC, V5899, P160, DOI 10.1007/978-3-642-10509-8_19