Visible to the public Delving in the loss landscape to embed robust watermarks into neural networks

TitleDelving in the loss landscape to embed robust watermarks into neural networks
Publication TypeConference Paper
Year of Publication2021
AuthorsTartaglione, Enzo, Grangetto, Marco, Cavagnino, Davide, Botta, Marco
Conference Name2020 25th International Conference on Pattern Recognition (ICPR)
Date Publishedjan
KeywordsAdaptation models, Artificial neural networks, Human Behavior, pattern locks, pubcrawl, Redundancy, resilience, Resiliency, Scalability, Sensitivity, Shape, Training, Watermarking
AbstractIn the last decade the use of artificial neural networks (ANNs) in many fields like image processing or speech recognition has become a common practice because of their effectiveness to solve complex tasks. However, in such a rush, very little attention has been paid to security aspects. In this work we explore the possibility to embed a watermark into the ANN parameters. We exploit model redundancy and adaptation capacity to lock a subset of its parameters to carry the watermark sequence. The watermark can be extracted in a simple way to claim copyright on models but can be very easily attacked with model fine-tuning. To tackle this culprit we devise a novel watermark aware training strategy. We aim at delving into the loss landscape to find an optimal configuration of the parameters such that we are robust to fine-tuning attacks towards the watermarked parameters. Our experimental results on classical ANN models trained on well-known MNIST and CIFAR-10 datasets show that the proposed approach makes the embedded watermark robust to fine-tuning and compression attacks.
DOI10.1109/ICPR48806.2021.9413062
Citation Keytartaglione_delving_2021