Delving in the loss landscape to embed robust watermarks into neural networks

Submitted by grigby1 on Fri, 07/29/2022 - 9:45am

Title	Delving in the loss landscape to embed robust watermarks into neural networks
Publication Type	Conference Paper
Year of Publication	2021
Authors	Tartaglione, Enzo, Grangetto, Marco, Cavagnino, Davide, Botta, Marco
Conference Name	2020 25th International Conference on Pattern Recognition (ICPR)
Date Published	jan
Keywords	Adaptation models, Artificial neural networks, Human Behavior, pattern locks, pubcrawl, Redundancy, resilience, Resiliency, Scalability, Sensitivity, Shape, Training, Watermarking
Abstract	In the last decade the use of artificial neural networks (ANNs) in many fields like image processing or speech recognition has become a common practice because of their effectiveness to solve complex tasks. However, in such a rush, very little attention has been paid to security aspects. In this work we explore the possibility to embed a watermark into the ANN parameters. We exploit model redundancy and adaptation capacity to lock a subset of its parameters to carry the watermark sequence. The watermark can be extracted in a simple way to claim copyright on models but can be very easily attacked with model fine-tuning. To tackle this culprit we devise a novel watermark aware training strategy. We aim at delving into the loss landscape to find an optimal configuration of the parameters such that we are robust to fine-tuning attacks towards the watermarked parameters. Our experimental results on classical ANN models trained on well-known MNIST and CIFAR-10 datasets show that the proposed approach makes the embedded watermark robust to fine-tuning and compression attacks.
DOI	10.1109/ICPR48806.2021.9413062
Citation Key	tartaglione_delving_2021

Groups:

Science of Security VO