Visible to the public Malware classification using static analysis based features

TitleMalware classification using static analysis based features
Publication TypeConference Paper
Year of Publication2017
AuthorsHassen, M., Carvalho, M. M., Chan, P. K.
Conference Name2017 IEEE Symposium Series on Computational Intelligence (SSCI)
Date Publishednov
Keywordsanti-virus vendors, computer viruses, control statement shingling, Decision trees, feature extraction, Human Behavior, learning (artificial intelligence), machine learning algorithms, machine learning features, Malware, malware binaries, malware classification, Metrics, ordinary opcode n-gram based features, pattern classification, privacy, program diagnostics, pubcrawl, resilience, Resiliency, static analysis, Training, Vegetation
Abstract

Anti-virus vendors receive hundreds of thousands of malware to be analysed each day. Some are new malware while others are variations or evolutions of existing malware. Because analyzing each malware sample by hand is impossible, automated techniques to analyse and categorize incoming samples are needed. In this work, we explore various machine learning features extracted from malware samples through static analysis for classification of malware binaries into already known malware families. We present a new feature based on control statement shingling that has a comparable accuracy to ordinary opcode n-gram based features while requiring smaller dimensions. This, in turn, results in a shorter training time.

URLhttps://ieeexplore.ieee.org/document/8285426/
DOI10.1109/SSCI.2017.8285426
Citation Keyhassen_malware_2017