Visible to the public DP-CGAN: Differentially Private Synthetic Data and Label Generation

TitleDP-CGAN: Differentially Private Synthetic Data and Label Generation
Publication TypeConference Paper
Year of Publication2019
AuthorsTorkzadehmahani, Reihaneh, Kairouz, Peter, Paten, Benedict
Conference Name2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Date Publishedjun
KeywordsAI, Data models, data privacy, differentially private conditional GAN training framework, differentially private synthetic data, DP-CGAN, Gallium nitride, GAN models, generative adversarial networks, Generators, Human Behavior, human factors, label generation, learning (artificial intelligence), MNIST dataset, original sensitive datasets, privacy, pubcrawl, Renyi differential privacy accountant, research communities, resilience, Resiliency, Scalability, single-digit epsilon parameter, spent privacy budget, Training, training dataset
AbstractGenerative Adversarial Networks (GANs) are one of the well-known models to generate synthetic data including images, especially for research communities that cannot use original sensitive datasets because they are not publicly accessible. One of the main challenges in this area is to preserve the privacy of individuals who participate in the training of the GAN models. To address this challenge, we introduce a Differentially Private Conditional GAN (DP-CGAN) training framework based on a new clipping and perturbation strategy, which improves the performance of the model while preserving privacy of the training dataset. DP-CGAN generates both synthetic data and corresponding labels and leverages the recently introduced Renyi differential privacy accountant to track the spent privacy budget. The experimental results show that DP-CGAN can generate visually and empirically promising results on the MNIST dataset with a single-digit epsilon parameter in differential privacy.
DOI10.1109/CVPRW.2019.00018
Citation Keytorkzadehmahani_dp-cgan_2019