Visible to the public Deep Android Malware Classification with API-Based Feature Graph

TitleDeep Android Malware Classification with API-Based Feature Graph
Publication TypeConference Paper
Year of Publication2019
AuthorsHuang, N., Xu, M., Zheng, N., Qiao, T., Choo, K. R.
Conference Name2019 18th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/13th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE)
Date PublishedAug. 2019
PublisherIEEE
ISBN Number978-1-7281-2777-4
KeywordsAnalytical models, Android (operating system), Android malware, Android malware apps, Android malware classification, API features, API-based feature graph classification, application program interfaces, CNN-based classifier, Deep Learning, feature extraction, feature selection, graph theory, hand-refined API-based feature graph, Human Behavior, invasive software, Malware, malware analysis, malware apps, malware class, Mathematical model, Metrics, mobile computing, privacy, pubcrawl, resilience, Resiliency, security, Silicon, structure analysis
Abstract

The rapid growth of Android malware apps poses a great security threat to users thus it is very important and urgent to detect Android malware effectively. What's more, the increasing unknown malware and evasion technique also call for novel detection method. In this paper, we focus on API feature and develop a novel method to detect Android malware. First, we propose a novel selection method for API feature related with the malware class. However, such API also has a legitimate use in benign app thus causing FP problem (misclassify benign as malware). Second, we further explore structure relationships between these APIs and map to a matrix interpreted as the hand-refined API-based feature graph. Third, a CNN-based classifier is developed for the API-based feature graph classification. Evaluations of a real-world dataset containing 3,697 malware apps and 3,312 benign apps demonstrate that selected API feature is effective for Android malware classification, just top 20 APIs can achieve high F1 of 94.3% under Random Forest classifier. When the available API features are few, classification performance including FPR indicator can achieve effective improvement effectively by complementing our further work.

URLhttps://ieeexplore.ieee.org/document/8887380
DOI10.1109/TrustCom/BigDataSE.2019.00047
Citation Keyhuang_deep_2019