Visible to the public On the Generality of Facial Forgery Detection

TitleOn the Generality of Facial Forgery Detection
Publication TypeConference Paper
Year of Publication2019
AuthorsBrockschmidt, J., Shang, J., Wu, J.
Conference Name2019 IEEE 16th International Conference on Mobile Ad Hoc and Sensor Systems Workshops (MASSW)
Date Publishednov
KeywordsCNN, CNN architectures, Computer architecture, convolutional neural nets, convolutional neural networks, DeepFake, Deepfakes, detection architectures, Face, face recognition, Face2Face, FaceSwap, facial forgery detection, feature extraction, Forgery, GANnotation, Human Behavior, human factors, ICface, image forgery detection, MesoNet, Metrics, multiple spoofing techniques, object detection, pubcrawl, resilience, Resiliency, Scalability, Streaming media, Task Analysis, Tools, unseen spoofing techniques, unseen techniques, video streaming, X2Face, XceptionNet
AbstractA variety of architectures have been designed or repurposed for the task of facial forgery detection. While many of these designs have seen great success, they largely fail to address challenges these models may face in practice. A major challenge is posed by generality, wherein models must be prepared to perform in a variety of domains. In this paper, we investigate the ability of state-of-the-art facial forgery detection architectures to generalize. We first propose two criteria for generality: reliably detecting multiple spoofing techniques and reliably detecting unseen spoofing techniques. We then devise experiments which measure how a given architecture performs against these criteria. Our analysis focuses on two state-of-the-art facial forgery detection architectures, MesoNet and XceptionNet, both being convolutional neural networks (CNNs). Our experiments use samples from six state-of-the-art facial forgery techniques: Deepfakes, Face2Face, FaceSwap, GANnotation, ICface, and X2Face. We find MesoNet and XceptionNet show potential to generalize to multiple spoofing techniques but with a slight trade-off in accuracy, and largely fail against unseen techniques. We loosely extrapolate these results to similar CNN architectures and emphasize the need for better architectures to meet the challenges of generality.
DOI10.1109/MASSW.2019.00015
Citation Keybrockschmidt_generality_2019