Automatic Ransomware Detection and Analysis Based on Dynamic API Calls Flow Graph
Title | Automatic Ransomware Detection and Analysis Based on Dynamic API Calls Flow Graph |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Chen, Zhi-Guo, Kang, Ho-Seok, Yin, Shang-Nan, Kim, Sung-Ryul |
Conference Name | Proceedings of the International Conference on Research in Adaptive and Convergent Systems |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-5027-3 |
Keywords | API CFG (calls flow graph), composability, data mining, dynamic analysis, Metrics, pubcrawl, ransomware, ransomware detection, resilience, Resiliency |
Abstract | In recent cyber incidents, Ransom software (ransomware) causes a major threat to the security of computer systems. Consequently, ransomware detection has become a hot topic in computer security. Unfortunately, current signature-based and static detection model is often easily evadable by obfuscation, polymorphism, compress, and encryption. For overcoming the lack of signature-based and static ransomware detection approach, we have proposed the dynamic ransomware detection system using data mining techniques such as Random Forest (RF), Support Vector Machine (SVM), Simple Logistic (SL) and Naive Bayes (NB) algorithms for detecting known and unknown ransomware. We monitor the actual (dynamic) behaviors of software to generate API calls flow graphs (CFG) and transfer it in a feature space. Thereafter, data normalization and feature selection were applied to select informative features which are the best for discriminating between various categories of software and benign software. Finally, the data mining algorithms were used for building the detection model for judging whether the software is benign software or ransomware. Our experimental results show that our proposed system can be more effective to improve the performance for ransomware detection. Especially, the accuracy and detection rate of our proposed system with Simple Logistic (SL) algorithm can achieve to 98.2% and 97.6%, respectively. Meanwhile, the false positive rate also can be reduced to 1.2%. |
URL | https://dl.acm.org/citation.cfm?doid=3129676.3129704 |
DOI | 10.1145/3129676.3129704 |
Citation Key | chen_automatic_2017 |