Visible to the public Improved differential privacy K-means clustering algorithm for privacy budget allocation

TitleImproved differential privacy K-means clustering algorithm for privacy budget allocation
Publication TypeConference Paper
Year of Publication2022
AuthorsHan, Liquan, Xie, Yushan, Fan, Di, Liu, Jinyuan
Conference Name2022 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)
Date Publishedjul
KeywordsADPK-means clustering, Clustering algorithms, clustering centroids, clustering set separation, composability, dataset dissimilarity, Differential privacy, Heuristic algorithms, Human Behavior, Perturbation methods, privacy, privacy budget allocation, pubcrawl, resilience, Resiliency, Resource management, Scalability, usability
AbstractIn the differential privacy clustering algorithm, the added random noise causes the clustering centroids to be shifted, which affects the usability of the clustering results. To address this problem, we design a differential privacy K-means clustering algorithm based on an adaptive allocation of privacy budget to the clustering effect: Adaptive Differential Privacy K-means (ADPK-means). The method is based on the evaluation results generated at the end of each iteration in the clustering algorithm. First, it dynamically evaluates the effect of the clustered sets at the end of each iteration by measuring the separation and tightness between the clustered sets. Then, the evaluation results are introduced into the process of privacy budget allocation by weighting the traditional privacy budget allocation. Finally, different privacy budgets are assigned to different sets of clusters in the iteration to achieve the purpose of adaptively adding perturbation noise to each set. In this paper, both theoretical and experimental results are analyzed, and the results show that the algorithm satisfies e-differential privacy and achieves better results in terms of the availability of clustering results for the three standard datasets.
DOI10.1109/ICCEAI55464.2022.00054
Citation Keyhan_improved_2022