Visible to the public Scalable and Accurate Binary Code Search Method Based on Simhash and Partial Trace

TitleScalable and Accurate Binary Code Search Method Based on Simhash and Partial Trace
Publication TypeConference Paper
Year of Publication2020
AuthorsZhang, Yunan, Xu, Aidong Xu, Jiang, Yixin
Conference Name2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)
Date PublishedJan. 2021
PublisherIEEE
ISBN Number978-1-6654-0392-4
KeywordsBinary Code Search, Binary codes, distortion, Filtering, Malware, Malware Homology Analysisi, Partial Trace, pubcrawl, Scalability, Scalable Security, security, simHash, Syntactics, Tools
Abstract

Binary code search has received much attention recently due to its impactful applications, e.g., plagiarism detection, malware detection and software vulnerability auditing. However, developing an effective binary code search tool is challenging due to the gigantic syntax and structural differences in binaries resulted from different compilers, compiler options and malware family. In this paper, we propose a scalable and accurate binary search engine which performs syntactic matching by combining a set of key techniques to address the challenges above. The key contribution is binary code searching technique which combined function filtering and partial trace method to match the function code relatively quick and accurate. In addition, a simhash and basic information based function filtering is proposed to dramatically reduce the irrelevant target functions. Besides, we introduce a partial trace method for matching the shortlisted function accurately. The experimental results show that our method can find similar functions, even with the presence of program structure distortion, in a scalable manner.

URLhttps://ieeexplore.ieee.org/document/9343136
DOI10.1109/TrustCom50675.2020.00111
Citation Keyzhang_scalable_2020