MAXS: Scaling Malware Execution with Sequential Multi-Hypothesis Testing

Submitted by grigby1 on Fri, 09/15/2017 - 10:10am

Title	MAXS: Scaling Malware Execution with Sequential Multi-Hypothesis Testing
Publication Type	Conference Paper
Year of Publication	2016
Authors	Vadrevu, Phani, Perdisci, Roberto
Conference Name	Proceedings of the 11th ACM on Asia Conference on Computer and Communications Security
Publisher	ACM
Conference Location	New York, NY, USA
ISBN Number	978-1-4503-4233-9
Keywords	bare-metal analysis, Human Behavior, malware analysis, malware classification, malware sandbox, Metrics, privacy, pubcrawl, Resiliency, Sandboxing
Abstract	In an attempt to coerce useful information about the behavior of new malware families, threat analysts commonly force newly collected malicious software samples to run within a sandboxed environment. The main goal is to gather intelligence that can later be leveraged to detect and enumerate new malware infections within a network. Currently, most analysis environments "blindly" execute each newly collected malware sample for a predetermined amount of time (e.g., four to five minutes). However, a large majority of malware samples that are forced through sandbox execution are simply repackaged versions of previously seen (and already analyzed) malware. Consequently, a significant amount of time may be wasted in analyzing samples that do not generate new intelligence. In this paper, we propose MAXS, a novel probabilistic multi-hypothesis testing framework for scaling execution in malware analysis environments, including bare-metal execution environments. Our main goal is to automatically recognize whether a malware sample that is undergoing dynamic analysis has likely been seen before (e.g., in a "differently packed" form), and determine if we could therefore stop its execution early while avoiding loss of valuable malware intelligence (e.g., without missing DNS queries to never-before-seen malware command-and-control domains). We have tested our prototype implementation of MAXS over two large collections of malware execution traces obtained from two distinct production-level analysis environments. Our experimental results show that using MAXS we are able to reduce malware execution time by up to 50% in average, with less than 0.3% information loss. This roughly translates into the ability to double the capacity of malware sandbox environments, thus significantly optimizing the resources dedicated to malware execution and analysis. Our results are particularly important for bare-metal execution environments, in which it is not easy to leverage the economies of scale that characterize virtual-machine or emulation based malware sandboxes. For example, MAXS could be used to significantly cut the cost of bare-metal analysis environments by reducing the hardware resources needed to analyze a predetermined daily number of new malware samples.
URL	http://doi.acm.org/10.1145/2897845.2897873
DOI	10.1145/2897845.2897873
Citation Key	vadrevu_maxs:_2016

Groups:

Science of Security VO