Visible to the public Embedding Fuzzy Rules with YARA Rules for Performance Optimisation of Malware Analysis

TitleEmbedding Fuzzy Rules with YARA Rules for Performance Optimisation of Malware Analysis
Publication TypeConference Paper
Year of Publication2020
AuthorsNaik, N., Jenkins, P., Savage, N., Yang, L., Naik, K., Song, J.
Conference Name2020 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)
Date Publishedjul
Keywordscryptography, Fuzzy Cryptography, Fuzzy Hashing, Fuzzy logic, fuzzy rules, fuzzy set theory, Fuzzy sets, invasive software, Malware, malware analysis, Metrics, optimisation, Optimization, Pattern matching, Performance Optimisation, pubcrawl, ransomware, Resiliency, rule optimisation process, Scalability, Standards, YARA rule conditions, YARA rules
AbstractYARA rules utilises string or pattern matching to perform malware analysis and is one of the most effective methods in use today. However, its effectiveness is dependent on the quality and quantity of YARA rules employed in the analysis. This can be managed through the rule optimisation process, although, this may not necessarily guarantee effective utilisation of YARA rules and its generated findings during its execution phase, as the main focus of YARA rules is in determining whether to trigger a rule or not, for a suspect sample after examining its rule condition. YARA rule conditions are Boolean expressions, mostly focused on the binary outcome of the malware analysis, which may limit the optimised use of YARA rules and its findings despite generating significant information during the execution phase. Therefore, this paper proposes embedding fuzzy rules with YARA rules to optimise its performance during the execution phase. Fuzzy rules can manage imprecise and incomplete data and encompass a broad range of conditions, which may not be possible in Boolean logic. This embedding may be more advantageous when the YARA rules become more complex, resulting in multiple complex conditions, which may not be processed efficiently utilising Boolean expressions alone, thus compromising effective decision-making. This proposed embedded approach is applied on a collected malware corpus and is tested against the standard and enhanced YARA rules to demonstrate its success.
DOI10.1109/FUZZ48607.2020.9177856
Citation Keynaik_embedding_2020