A Novel Malware Analysis for Malware Detection and Classification Using Machine Learning Algorithms

Submitted by grigby1 on Wed, 06/20/2018 - 12:56pm

Title	A Novel Malware Analysis for Malware Detection and Classification Using Machine Learning Algorithms
Publication Type	Conference Paper
Year of Publication	2017
Authors	Sethi, Kamalakanta, Chaudhary, Shankar Kumar, Tripathy, Bata Krishan, Bera, Padmalochan
Conference Name	Proceedings of the 10th International Conference on Security of Information and Networks
Date Published	October 2017
Publisher	ACM
Conference Location	New York, NY, USA
ISBN Number	978-1-4503-5303-8
Keywords	cuckoo sandbox, Human Behavior, malware classification, malware detection, Metrics, privacy, pubcrawl, resilience, Resiliency, SMO, static and dynamic analysis
Abstract	Nowadays, Malware has become a serious threat to the digitization of the world due to the emergence of various new and complex malware every day. Due to this, the traditional signature-based methods for detection of malware effectively becomes an obsolete method. The efficiency of the machine learning model in context to the detection of malware files has been proved by different researches and studies. In this paper, a framework has been developed to detect and classify different files (e.g exe, pdf, php, etc.) as benign and malicious using two level classifier namely, Macro (for detection of malware) and Micro (for classification of malware files as a Trojan, Spyware, Adware, etc.). Cuckoo Sandbox is used for generating static and dynamic analysis report by executing files in the virtual environment. In addition, a novel model is developed for extracting features based on static, behavioral and network analysis using analysis report generated by the Cuckoo Sandbox. Weka Framework is used to develop machine learning models by using training datasets. The experimental results using proposed framework shows high detection rate with an accuracy of 100% using J48 Decision tree model, 99% using SMO (Sequential Minimal Optimization) and 97% using Random Forest tree. It also shows effective classification rate with accuracy 100% using J48 Decision tree, 91% using SMO and 66% using Random Forest tree. These results are used for detecting and classifying unknown files as benign or malicious.
URL	https://dl.acm.org/doi/10.1145/3136825.3136883
DOI	10.1145/3136825.3136883
Citation Key	sethi_novel_2017

Groups:

Science of Security VO