A Novel Malware Analysis for Malware Detection and Classification Using Machine Learning Algorithms
Title | A Novel Malware Analysis for Malware Detection and Classification Using Machine Learning Algorithms |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Sethi, Kamalakanta, Chaudhary, Shankar Kumar, Tripathy, Bata Krishan, Bera, Padmalochan |
Conference Name | Proceedings of the 10th International Conference on Security of Information and Networks |
Date Published | October 2017 |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-5303-8 |
Keywords | cuckoo sandbox, Human Behavior, malware classification, malware detection, Metrics, privacy, pubcrawl, resilience, Resiliency, SMO, static and dynamic analysis |
Abstract | Nowadays, Malware has become a serious threat to the digitization of the world due to the emergence of various new and complex malware every day. Due to this, the traditional signature-based methods for detection of malware effectively becomes an obsolete method. The efficiency of the machine learning model in context to the detection of malware files has been proved by different researches and studies. In this paper, a framework has been developed to detect and classify different files (e.g exe, pdf, php, etc.) as benign and malicious using two level classifier namely, Macro (for detection of malware) and Micro (for classification of malware files as a Trojan, Spyware, Adware, etc.). Cuckoo Sandbox is used for generating static and dynamic analysis report by executing files in the virtual environment. In addition, a novel model is developed for extracting features based on static, behavioral and network analysis using analysis report generated by the Cuckoo Sandbox. Weka Framework is used to develop machine learning models by using training datasets. The experimental results using proposed framework shows high detection rate with an accuracy of 100% using J48 Decision tree model, 99% using SMO (Sequential Minimal Optimization) and 97% using Random Forest tree. It also shows effective classification rate with accuracy 100% using J48 Decision tree, 91% using SMO and 66% using Random Forest tree. These results are used for detecting and classifying unknown files as benign or malicious. |
URL | https://dl.acm.org/doi/10.1145/3136825.3136883 |
DOI | 10.1145/3136825.3136883 |
Citation Key | sethi_novel_2017 |