Detecting Malicious PowerShell Commands Using Deep Neural Networks

Submitted by aekwall on Wed, 01/16/2019 - 2:08pm

Title	Detecting Malicious PowerShell Commands Using Deep Neural Networks
Publication Type	Conference Paper
Year of Publication	2018
Authors	Hendler, Danny, Kels, Shay, Rubin, Amir
Conference Name	Proceedings of the 2018 on Asia Conference on Computer and Communications Security
Publisher	ACM
Conference Location	New York, NY, USA
ISBN Number	978-1-4503-5576-6
Keywords	composability, Deep Learning, malware detection, Metrics, natural language processing, Neural networks, powershell, pubcrawl, Resiliency, Security by Default, Windows Operating System Security
Abstract	Microsoft's PowerShell is a command-line shell and scripting language that is installed by default on Windows machines. Based on Microsoft's .NET framework, it includes an interface that allows programmers to access operating system services. While PowerShell can be configured by administrators for restricting access and reducing vulnerabilities, these restrictions can be bypassed. Moreover, PowerShell commands can be easily generated dynamically, executed from memory, encoded and obfuscated, thus making the logging and forensic analysis of code executed by PowerShell challenging. For all these reasons, PowerShell is increasingly used by cybercriminals as part of their attacks' tool chain, mainly for downloading malicious contents and for lateral movement. Indeed, a recent comprehensive technical report by Symantec dedicated to PowerShell's abuse by cybercrimials [52] reported on a sharp increase in the number of malicious PowerShell samples they received and in the number of penetration tools and frameworks that use PowerShell. This highlights the urgent need of developing effective methods for detecting malicious PowerShell commands. In this work, we address this challenge by implementing several novel detectors of malicious PowerShell commands and evaluating their performance. We implemented both "traditional" natural language processing (NLP) based detectors and detectors based on character-level convolutional neural networks (CNNs). Detectors' performance was evaluated using a large real-world dataset. Our evaluation results show that, although our detectors (and especially the traditional NLP-based ones) individually yield high performance, an ensemble detector that combines an NLP-based classifier with a CNN-based classifier provides the best performance, since the latter classifier is able to detect malicious commands that succeed in evading the former. Our analysis of these evasive commands reveals that some obfuscation patterns automatically detected by the CNN classifier are intrinsically difficult to detect using the NLP techniques we applied. Our detectors provide high recall values while maintaining a very low false positive rate, making us cautiously optimistic that they can be of practical value.
URL	http://doi.acm.org/10.1145/3196494.3196511
DOI	10.1145/3196494.3196511
Citation Key	hendler_detecting_2018

Groups:

Science of Security VO