Deep Android Malware Classification with API-Based Feature Graph

Submitted by grigby1 on Fri, 12/11/2020 - 2:33pm

Title	Deep Android Malware Classification with API-Based Feature Graph
Publication Type	Conference Paper
Year of Publication	2019
Authors	Huang, N., Xu, M., Zheng, N., Qiao, T., Choo, K. R.
Conference Name	2019 18th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/13th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE)
Date Published	Aug. 2019
Publisher	IEEE
ISBN Number	978-1-7281-2777-4
Keywords	Analytical models, Android (operating system), Android malware, Android malware apps, Android malware classification, API features, API-based feature graph classification, application program interfaces, CNN-based classifier, Deep Learning, feature extraction, feature selection, graph theory, hand-refined API-based feature graph, Human Behavior, invasive software, Malware, malware analysis, malware apps, malware class, Mathematical model, Metrics, mobile computing, privacy, pubcrawl, resilience, Resiliency, security, Silicon, structure analysis
Abstract	The rapid growth of Android malware apps poses a great security threat to users thus it is very important and urgent to detect Android malware effectively. What's more, the increasing unknown malware and evasion technique also call for novel detection method. In this paper, we focus on API feature and develop a novel method to detect Android malware. First, we propose a novel selection method for API feature related with the malware class. However, such API also has a legitimate use in benign app thus causing FP problem (misclassify benign as malware). Second, we further explore structure relationships between these APIs and map to a matrix interpreted as the hand-refined API-based feature graph. Third, a CNN-based classifier is developed for the API-based feature graph classification. Evaluations of a real-world dataset containing 3,697 malware apps and 3,312 benign apps demonstrate that selected API feature is effective for Android malware classification, just top 20 APIs can achieve high F1 of 94.3% under Random Forest classifier. When the available API features are few, classification performance including FPR indicator can achieve effective improvement effectively by complementing our further work.
URL	https://ieeexplore.ieee.org/document/8887380
DOI	10.1109/TrustCom/BigDataSE.2019.00047
Citation Key	huang_deep_2019

Groups:

Science of Security VO