Visible to the public A Non-Parametric Model for Accurate and Provably Private Synthetic Data Sets

TitleA Non-Parametric Model for Accurate and Provably Private Synthetic Data Sets
Publication TypeConference Paper
Year of Publication2017
AuthorsSoria-Comas, Jordi, Domingo-Ferrer, Josep
Conference NameProceedings of the 12th International Conference on Availability, Reliability and Security
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-5257-4
Keywordsformal privacy, Measurement, Metrics, non-parametric methods, privacy, privacy models, pubcrawl, Synthetic Data, ε-synthetic privacy
Abstract

Generating synthetic data is a well-known option to limit disclosure risk in sensitive data releases. The usual approach is to build a model for the population and then generate a synthetic data set solely based on the model. We argue that building an accurate population model is difficult and we propose instead to approximate the original data as closely as privacy constraints permit. To enforce an ex ante privacy level when generating synthetic data, we introduce a new privacy model called $e$ synthetic privacy. Then, we describe a synthetic data generation method that satisfies $e$-synthetic privacy. Finally, we evaluate the utility of the synthetic data generated with our method.

URLhttps://dl.acm.org/citation.cfm?doid=3098954.3098962
DOI10.1145/3098954.3098962
Citation Keysoria-comas_non-parametric_2017