Detecting Malicious PowerShell Commands using Deep Neural Networks

被引：42

作者：

Hendler, Danny ^{[1
]}

Kels, Shay ^{[2
]}

Rubin, Amir ^{[1
]}

机构：

[1] Ben Gurion Univ Negev, Beer Sheva, Israel

[2] Microsoft, Herzliyya, Israel

来源：

PROCEEDINGS OF THE 2018 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIACCS'18) | 2018年

关键词：

PowerShell; malware detection; neural networks; natural language processing; deep learning;

D O I：

10.1145/3196494.3196511

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Microsoft's PowerShell is a command-line shell and scripting language that is installed by default on Windows machines. Based on Microsoft's.NET framework, it includes an interface that allows programmers to access operating system services. While PowerShell can be configured by administrators for restricting access and reducing vulnerabilities, these restrictions can be bypassed. Moreover, PowerShell commands can be easily generated dynamically, executed from memory, encoded and obfuscated, thus making the logging and forensic analysis of code executed by PowerShell challenging. For all these reasons, PowerShell is increasingly used by cybercriminals as part of their attacks' tool chain, mainly for downloading malicious contents and for lateral movement. Indeed, a recent comprehensive technical report by Symantec dedicated to PowerShell's abuse by cybercrimials [52] reported on a sharp increase in the number of malicious PowerShell samples they received and in the number of penetration tools and frameworks that use PowerShell. This highlights the urgent need of developing effective methods for detecting malicious PowerShell commands. In this work, we address this challenge by implementing several novel detectors of malicious PowerShell commands and evaluating their performance. We implemented both "traditional" natural language processing (NLP) based detectors and detectors based on character-level convolutional neural networks (CNNs). Detectors' performance was evaluated using a large real-world dataset. Our evaluation results show that, although our detectors (and especially the traditional NLP-based ones) individually yield high performance, an ensemble detector that combines an NLP-based classifier with a CNN -based classifier provides the best performance, since the latter classifier is able to detect malicious commands that succeed in evading the former. Our analysis of these evasive commands reveals that some obfuscation patterns automatically detected by the CNN classifier are intrinsically difficult to detect using the NLP techniques we applied. Our detectors provide high recall values while maintaining a very low false positive rate, making us cautiously optimistic that they can be of practical value.

引用

页码：187 / 197

页数：11

共 57 条

[1]

[Anonymous], P 30 INT FLOR ART IN

[2]

[Anonymous], 1999, FDN STAT NATURAL LAN

[3]

[Anonymous], 2014, P 2014 C EMP METH NA, DOI 10.3115/v1/D14-1003

[4]

[Anonymous], 2013, PREPRINT ARXIV 1308

[5]

[Anonymous], POWERSHELL

[6]

[Anonymous], ANT SCAN INT

[7]

[Anonymous], 1997, ARTIFICIAL NEURAL NE

[8]

[Anonymous], 1982, Competition and Cooperation in Neural Nets, DOI DOI 10.1007/978-3-642-46466-9_18

[9]

[Anonymous], INVOKE OBFUSCATION M

[10]

[Anonymous], ARXIV171103947

← 1 2 3 4 5 6 →