Nomen Est Omen - The Role of Signatures in Ascribing Email Author Identity with Transformer Neural Networks

Submitted by grigby1 on Tue, 04/19/2022 - 1:16pm

Title	Nomen Est Omen - The Role of Signatures in Ascribing Email Author Identity with Transformer Neural Networks
Publication Type	Conference Paper
Year of Publication	2021
Authors	Srinivasan, Sudarshan, Begoli, Edmon, Mahbub, Maria, Knight, Kathryn
Conference Name	2021 IEEE Security and Privacy Workshops (SPW)
Date Published	may
Keywords	adversarial perturbation, attention-based models, authorship attribution, digital forensics, Forensics, natural language processing, Natural languages, Neural networks, Perturbation methods, privacy, pubcrawl, resilience, Resiliency, Scalability, Sensitivity, signature based defense, Training, transformer-based networks
Abstract	Authorship attribution, an NLP problem where anonymous text is matched to its author, has important, cross-disciplinary applications, particularly those concerning cyber-defense. Our research examines the degree of sensitivity that attention-based models have to adversarial perturbations. We ask, what is the minimal amount of change necessary to maximally confuse a transformer model? In our investigation we examine a balanced subset of emails from the Enron email dataset, calculating the performance of our model before and after email signatures have been perturbed. Results show that the model's performance changed significantly in the absence of a signature, indicating the importance of email signatures in email authorship detection. Furthermore, we show that these models rely on signatures for shorter emails much more than for longer emails. We also indicate that additional research is necessary to investigate stylometric features and adversarial training to further improve classification model robustness.
DOI	10.1109/SPW53761.2021.00049
Citation Key	srinivasan_nomen_2021

Groups:

Science of Security VO