Title | Nomen Est Omen - The Role of Signatures in Ascribing Email Author Identity with Transformer Neural Networks |
Publication Type | Conference Paper |
Year of Publication | 2021 |
Authors | Srinivasan, Sudarshan, Begoli, Edmon, Mahbub, Maria, Knight, Kathryn |
Conference Name | 2021 IEEE Security and Privacy Workshops (SPW) |
Date Published | may |
Keywords | adversarial perturbation, attention-based models, authorship attribution, digital forensics, Forensics, natural language processing, Natural languages, Neural networks, Perturbation methods, privacy, pubcrawl, resilience, Resiliency, Scalability, Sensitivity, signature based defense, Training, transformer-based networks |
Abstract | Authorship attribution, an NLP problem where anonymous text is matched to its author, has important, cross-disciplinary applications, particularly those concerning cyber-defense. Our research examines the degree of sensitivity that attention-based models have to adversarial perturbations. We ask, what is the minimal amount of change necessary to maximally confuse a transformer model? In our investigation we examine a balanced subset of emails from the Enron email dataset, calculating the performance of our model before and after email signatures have been perturbed. Results show that the model's performance changed significantly in the absence of a signature, indicating the importance of email signatures in email authorship detection. Furthermore, we show that these models rely on signatures for shorter emails much more than for longer emails. We also indicate that additional research is necessary to investigate stylometric features and adversarial training to further improve classification model robustness. |
DOI | 10.1109/SPW53761.2021.00049 |
Citation Key | srinivasan_nomen_2021 |