Language-Agnostic Injection Detection

Submitted by grigby1 on Mon, 11/29/2021 - 2:28pm

Title	Language-Agnostic Injection Detection
Publication Type	Conference Paper
Year of Publication	2020
Authors	Hermerschmidt, Lars, Straub, Andreas, Piskachev, Goran
Conference Name	2020 IEEE Security and Privacy Workshops (SPW)
Date Published	may
Keywords	composability, Conferences, data mining, formal languages, fuzzing, Metrics, privacy, pubcrawl, security, Software systems, taint analysis
Abstract	Formal languages are ubiquitous wherever software systems need to exchange or store data. Unparsing into and parsing from such languages is an error-prone process that has spawned an entire class of security vulnerabilities. There has been ample research into finding vulnerabilities on the parser side, but outside of language specific approaches, few techniques targeting unparser vulnerabilities exist. This work presents a language-agnostic approach for spotting injection vulnerabilities in unparsers. It achieves this by mining unparse trees using dynamic taint analysis to extract language keywords, which are leveraged for guided fuzzing. Vulnerabilities can thus be found without requiring prior knowledge about the formal language, and in fact, the approach is even applicable where no specification thereof exists at all. This empowers security researchers and developers alike to gain deeper understanding of unparser implementations through examination of the unparse trees generated by the approach, as well as enabling them to find new vulnerabilities in poorly-understood software. This work presents a language-agnostic approach for spotting injection vulnerabilities in unparsers. It achieves this by mining unparse trees using dynamic taint analysis to extract language keywords, which are leveraged for guided fuzzing. Vulnerabilities can thus be found without requiring prior knowledge about the formal language, and in fact, the approach is even applicable where no specification thereof exists at all. This empowers security researchers and developers alike to gain deeper understanding of unparser implementations through examination of the unparse trees generated by the approach, as well as enabling them to find new vulnerabilities in poorly-understood software.
DOI	10.1109/SPW50608.2020.00060
Citation Key	hermerschmidt_language-agnostic_2020

Groups:

Science of Security VO