Differential Inference Testing: A Practical Approach to Evaluate Sanitizations of Datasets
Title | Differential Inference Testing: A Practical Approach to Evaluate Sanitizations of Datasets |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Kassem, Ali, Ács, Gergely, Castelluccia, Claude, Palamidessi, Catuscia |
Conference Name | 2019 IEEE Security and Privacy Workshops (SPW) |
Date Published | May 2019 |
Publisher | IEEE |
ISBN Number | 978-1-7281-3508-3 |
Keywords | compositionality, data deletion, data privacy, Data Sanitization, differential inference testing, Differential privacy, Human Behavior, human factors, inference mechanisms, Inferences, k-anonymity, l-Diversity, machine learning, personal information, Predictive models, privacy, privacy models, pubcrawl, resilience, Resiliency, sanitization, sanitization techniques, sanitized data, Scalability, sensitive attribute, sensitive information, Sociology, Testing |
Abstract | In order to protect individuals' privacy, data have to be "well-sanitized" before sharing them, i.e. one has to remove any personal information before sharing data. However, it is not always clear when data shall be deemed well-sanitized. In this paper, we argue that the evaluation of sanitized data should be based on whether the data allows the inference of sensitive information that is specific to an individual, instead of being centered around the concept of re-identification. We propose a framework to evaluate the effectiveness of different sanitization techniques on a given dataset by measuring how much an individual's record from the sanitized dataset influences the inference of his/her own sensitive attribute. Our intent is not to accurately predict any sensitive attribute but rather to measure the impact of a single record on the inference of sensitive information. We demonstrate our approach by sanitizing two real datasets in different privacy models and evaluate/compare each sanitized dataset in our framework. |
URL | https://ieeexplore.ieee.org/document/8844631 |
DOI | 10.1109/SPW.2019.00024 |
Citation Key | kassem_differential_2019 |
- Predictive models
- testing
- Sociology
- sensitive information
- sensitive attribute
- Scalability
- sanitized data
- sanitization techniques
- sanitization
- Resiliency
- resilience
- pubcrawl
- privacy models
- privacy
- Compositionality
- personal information
- machine learning
- l-Diversity
- k-anonymity
- Inferences
- inference mechanisms
- Human Factors
- Human behavior
- differential privacy
- differential inference testing
- Data Sanitization
- data privacy
- data deletion