Malware Classification with Deep Convolutional Neural Networks
Title | Malware Classification with Deep Convolutional Neural Networks |
Publication Type | Conference Paper |
Year of Publication | 2018 |
Authors | Kalash, M., Rochan, M., Mohammed, N., Bruce, N. D. B., Wang, Y., Iqbal, F. |
Conference Name | 2018 9th IFIP International Conference on New Technologies, Mobility and Security (NTMS) |
Date Published | Feb. 2018 |
Publisher | IEEE |
ISBN Number | 979-11-88428-01-4 |
Keywords | challenging malware classification datasets, CNN, Computer architecture, convolution, convolutional neural networks, deep convolutional neural networks, Deep Learning, deep learning approach, deep learning framework, feedforward neural nets, Gray-scale, grayscale images, Human Behavior, image classification, invasive software, learning (artificial intelligence), Learning systems, machine learning, machine learning approaches, Malimg malware, Malware, malware binaries, malware classification, Metrics, Microsoft malware, privacy, pubcrawl, resilience, Resiliency, Support vector machines |
Abstract | In this paper, we propose a deep learning framework for malware classification. There has been a huge increase in the volume of malware in recent years which poses a serious security threat to financial institutions, businesses and individuals. In order to combat the proliferation of malware, new strategies are essential to quickly identify and classify malware samples so that their behavior can be analyzed. Machine learning approaches are becoming popular for classifying malware, however, most of the existing machine learning methods for malware classification use shallow learning algorithms (e.g. SVM). Recently, Convolutional Neural Networks (CNN), a deep learning approach, have shown superior performance compared to traditional learning algorithms, especially in tasks such as image classification. Motivated by this success, we propose a CNN-based architecture to classify malware samples. We convert malware binaries to grayscale images and subsequently train a CNN for classification. Experiments on two challenging malware classification datasets, Malimg and Microsoft malware, demonstrate that our method achieves better than the state-of-the-art performance. The proposed method achieves 98.52% and 99.97% accuracy on the Malimg and Microsoft datasets respectively. |
URL | https://ieeexplore.ieee.org/document/8323640 |
DOI | 10.1109/NTMS.2018.8328749 |
Citation Key | kalash_malware_2018 |
- learning (artificial intelligence)
- Support vector machines
- Resiliency
- resilience
- pubcrawl
- privacy
- Microsoft malware
- Metrics
- malware classification
- malware binaries
- malware
- Malimg malware
- machine learning approaches
- machine learning
- Learning systems
- challenging malware classification datasets
- invasive software
- image classification
- Human behavior
- grayscale images
- Gray-scale
- feedforward neural nets
- deep learning framework
- deep learning approach
- deep learning
- deep convolutional neural networks
- convolutional neural networks
- convolution
- computer architecture
- CNN