Visible to the public Randomized Bit Vector: Privacy-Preserving Encoding Mechanism

TitleRandomized Bit Vector: Privacy-Preserving Encoding Mechanism
Publication TypeConference Paper
Year of Publication2018
AuthorsSun, Lin, Zhang, Lan, Ye, Xiaojun
Conference NameProceedings of the 27th ACM International Conference on Information and Knowledge Management
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-6014-2
Keywordscompositionality, data anonymization, Data Sanitization, Differential privacy, Human Behavior, privacy, privacy-preserving record linkage, pubcrawl, pufferfish mechanism, resilience
AbstractRecently, many methods have been proposed to prevent privacy leakage in record linkage by encoding record pair data into another anonymous space. Nevertheless, they cannot perform well in some circumstances due to high computational complexities, low privacy guarantees or loss of data utility. In this paper, we propose distance-aware encoding mechanisms to compare numerical values in the anonymous space. We first embed numerical values into Hamming space by a low-computational encoding algorithm with randomized bit vector. To provide rigorous privacy guarantees, we use the random response based on differential privacy to keep global indistinguishability of original data and use Laplace noises via pufferfish mechanism to provide local indistinguishability. Besides, we provide an approach for embedding and privacy-related parameters selection to improve data utility. Experiments on datasets from different data distributions and application contexts validate that our approaches can be used efficiently in privacy-preserving record linkage tasks compared with previous works and have excellent performance even under very small privacy budgets.
URLhttp://doi.acm.org/10.1145/3269206.3271703
DOI10.1145/3269206.3271703
Citation Keysun_randomized_2018