Malware Classification Using Machine Learning Algorithms and Tools
Title | Malware Classification Using Machine Learning Algorithms and Tools |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Mahajan, Ginika, Saini, Bhavna, Anand, Shivam |
Conference Name | 2019 Second International Conference on Advanced Computational and Communication Paradigms (ICACCP) |
Date Published | Feb. 2019 |
Publisher | IEEE |
ISBN Number | 978-1-5386-7989-0 |
Keywords | Classification algorithms, Cohen's Kappa, comparable features, comparative classification, confusion matrix, emerging malwares, family classification, feature extraction, Human Behavior, invasive software, learning (artificial intelligence), machine learning, machine learning algorithms, Malware, malware classification, malware samples, matrix algebra, Metrics, pattern classification, privacy, pubcrawl, Random Forest, resilience, Resiliency, Support vector machines, Tools |
Abstract | Malware classification is the process of categorizing the families of malware on the basis of their signatures. This work focuses on classifying the emerging malwares on the basis of comparable features of similar malwares. This paper proposes a novel framework that categorizes malware samples into their families and can identify new malware samples for analysis. For this six diverse classification techniques of machine learning are used. To get more comparative and thus accurate classification results, analysis is done using two different tools, named as Knime and Orange. The work proposed can help in identifying and thus cleaning new malwares and classifying malware into their families. The correctness of family classification of malwares is investigated in terms of confusion matrix, accuracy and Cohen's Kappa. After evaluation it is analyzed that Random Forest gives the highest accuracy. |
URL | https://ieeexplore.ieee.org/document/8882965/ |
DOI | 10.1109/ICACCP.2019.8882965 |
Citation Key | mahajan_malware_2019 |
- malware
- tools
- Support vector machines
- Resiliency
- resilience
- Random Forest
- pubcrawl
- privacy
- pattern classification
- Metrics
- matrix algebra
- malware samples
- malware classification
- Classification algorithms
- machine learning algorithms
- machine learning
- learning (artificial intelligence)
- invasive software
- Human behavior
- feature extraction
- family classification
- emerging malwares
- confusion matrix
- comparative classification
- comparable features
- Cohen's Kappa