Improving the effectiveness and efficiency of dynamic malware analysis with machine learning
Title | Improving the effectiveness and efficiency of dynamic malware analysis with machine learning |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Kilgallon, S., Rosa, L. De La, Cavazos, J. |
Conference Name | 2017 Resilience Week (RWS) |
Date Published | Sept. 2017 |
Publisher | IEEE |
ISBN Number | 978-1-5090-6055-9 |
Keywords | Analytical models, computer security, cybersecurity, data mining, dynamic malware, Dynamic Malware Analysis, feature extraction, Human Behavior, information extraction, invasive software, learning (artificial intelligence), machine learning, Malware, malware analysis, malware classification, malware detection, Metrics, pattern classification, Predictive models, privacy, pubcrawl, Resiliency, sandbox environment, static analysis |
Abstract | As the malware threat landscape is constantly evolving and over one million new malware strains are being generated every day [1], early automatic detection of threats constitutes a top priority of cybersecurity research, and amplifies the need for more advanced detection and classification methods that are effective and efficient. In this paper, we present the application of machine learning algorithms to predict the length of time malware should be executed in a sandbox to reveal its malicious intent. We also introduce a novel hybrid approach to malware classification based on static binary analysis and dynamic analysis of malware. Static analysis extracts information from a binary file without executing it, and dynamic analysis captures the behavior of malware in a sandbox environment. Our experimental results show that by turning the aforementioned problems into machine learning problems, it is possible to get an accuracy of up to 90% on the prediction of the malware analysis run time and up to 92% on the classification of malware families. |
URL | http://ieeexplore.ieee.org/document/8088644/ |
DOI | 10.1109/RWEEK.2017.8088644 |
Citation Key | kilgallon_improving_2017 |
- malware
- static analysis
- sandbox environment
- Resiliency
- pubcrawl
- privacy
- Predictive models
- pattern classification
- Metrics
- malware detection
- malware classification
- Malware Analysis
- Analytical models
- machine learning
- learning (artificial intelligence)
- invasive software
- information extraction
- Human behavior
- feature extraction
- Dynamic Malware Analysis
- dynamic malware
- Data mining
- Cybersecurity
- computer security