Measuring Privacy in High Dimensional Microdata Collections
Title | Measuring Privacy in High Dimensional Microdata Collections |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Boukoros, Spyros, Katzenbeisser, Stefan |
Conference Name | Proceedings of the 12th International Conference on Availability, Reliability and Security |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-5257-4 |
Keywords | Human Behavior, human factors, Metrics, microdata, privacy, privacy metrics, pubcrawl, Resiliency, Scalability, Security Risk Estimation, user empowerment |
Abstract | Microdata is collected by companies in order to enhance their quality of service as well as the accuracy of their recommendation systems. These data often become publicly available after they have been sanitized. Recent reidentification attacks on publicly available, sanitized datasets illustrate the privacy risks involved in microdata collections. Currently, users have to trust the provider that their data will be safe in case data is published or if a privacy breach occurs. In this work, we empower users by developing a novel, user-centric tool for privacy measurement and a new lightweight privacy metric. The goal of our tool is to estimate users' privacy level prior to sharing their data with a provider. Hence, users can consciously decide whether to contribute their data. Our tool estimates an individuals' privacy level based on published popularity statistics regarding the items in the provider's database, and the users' microdata. In this work, we describe the architecture of our tool as well as a novel privacy metric, which is necessary for our setting where we do not have access to the provider's database. Our tool is user friendly, relying on smart visual results that raise privacy awareness. We evaluate our tool using three real world datasets, collected from major providers. We demonstrate strong correlations between the average anonymity set per user and the privacy score obtained by our metric. Our results illustrate that our tool which uses minimal information from the provider, estimates users' privacy levels comparably well, as if it had access to the actual database. |
URL | https://dl.acm.org/citation.cfm?doid=3098954.3098977 |
DOI | 10.1145/3098954.3098977 |
Citation Key | boukoros_measuring_2017 |