Text Analysis in Adversarial Settings: Does Deception Leave a Stylistic Trace?

Submitted by grigby1 on Mon, 01/27/2020 - 12:29pm

Title	Text Analysis in Adversarial Settings: Does Deception Leave a Stylistic Trace?
Publication Type	Journal Article
Year of Publication	2019
Authors	Gröndahl, Tommi, Asokan, N.
Journal	ACM Computing Surveys (CSUR)
Volume	52
Pagination	45:1-45:36
Date Published	June 2019
ISSN	0360-0300
Keywords	author identification, deanonymization, deception, Human Behavior, human factors, Metrics, pubcrawl, stylometry, text obfuscation
Abstract	Textual deception constitutes a major problem for online security. Many studies have argued that deceptiveness leaves traces in writing style, which could be detected using text classification techniques. By conducting an extensive literature review of existing empirical work, we demonstrate that while certain linguistic features have been indicative of deception in certain corpora, they fail to generalize across divergent semantic domains. We suggest that deceptiveness as such leaves no content-invariant stylistic trace, and textual similarity measures provide a superior means of classifying texts as potentially deceptive. Additionally, we discuss forms of deception beyond semantic content, focusing on hiding author identity by writing style obfuscation. Surveying the literature on both author identification and obfuscation techniques, we conclude that current style transformation methods fail to achieve reliable obfuscation while simultaneously ensuring semantic faithfulness to the original text. We propose that future work in style transformation should pay particular attention to disallowing semantically drastic changes.
URL	https://dl.acm.org/doi/10.1145/3310331
DOI	10.1145/3310331
Citation Key	grondahl_text_2019

Groups:

Science of Security VO