On the Generality of Facial Forgery Detection

Submitted by grigby1 on Fri, 01/15/2021 - 11:14am

Title	On the Generality of Facial Forgery Detection
Publication Type	Conference Paper
Year of Publication	2019
Authors	Brockschmidt, J., Shang, J., Wu, J.
Conference Name	2019 IEEE 16th International Conference on Mobile Ad Hoc and Sensor Systems Workshops (MASSW)
Date Published	nov
Keywords	CNN, CNN architectures, Computer architecture, convolutional neural nets, convolutional neural networks, DeepFake, Deepfakes, detection architectures, Face, face recognition, Face2Face, FaceSwap, facial forgery detection, feature extraction, Forgery, GANnotation, Human Behavior, human factors, ICface, image forgery detection, MesoNet, Metrics, multiple spoofing techniques, object detection, pubcrawl, resilience, Resiliency, Scalability, Streaming media, Task Analysis, Tools, unseen spoofing techniques, unseen techniques, video streaming, X2Face, XceptionNet
Abstract	A variety of architectures have been designed or repurposed for the task of facial forgery detection. While many of these designs have seen great success, they largely fail to address challenges these models may face in practice. A major challenge is posed by generality, wherein models must be prepared to perform in a variety of domains. In this paper, we investigate the ability of state-of-the-art facial forgery detection architectures to generalize. We first propose two criteria for generality: reliably detecting multiple spoofing techniques and reliably detecting unseen spoofing techniques. We then devise experiments which measure how a given architecture performs against these criteria. Our analysis focuses on two state-of-the-art facial forgery detection architectures, MesoNet and XceptionNet, both being convolutional neural networks (CNNs). Our experiments use samples from six state-of-the-art facial forgery techniques: Deepfakes, Face2Face, FaceSwap, GANnotation, ICface, and X2Face. We find MesoNet and XceptionNet show potential to generalize to multiple spoofing techniques but with a slight trade-off in accuracy, and largely fail against unseen techniques. We loosely extrapolate these results to similar CNN architectures and emphasize the need for better architectures to meet the challenges of generality.
DOI	10.1109/MASSW.2019.00015
Citation Key	brockschmidt_generality_2019

Groups:

Science of Security VO