Algorithm for k-anonymity based on ball-tree and projection area density partition
Title | Algorithm for k-anonymity based on ball-tree and projection area density partition |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Cheng, Chen, Xiaoli, Liu, Linfeng, Wei, Longxin, Lin, Xiaofeng, Wu |
Conference Name | 2019 14th International Conference on Computer Science Education (ICCSE) |
Date Published | Aug. 2019 |
Publisher | IEEE |
ISBN Number | 978-1-7281-1846-8 |
Keywords | Adult dataset, anonymity, ball-tree, Classification algorithms, composability, data privacy, Gotrack dataset, Human Behavior, individual privacy protection, information science, k-anonymity, k-anonymity algorithm, kd-tree, Loss measurement, Metrics, microdata publishing, Partitioning algorithms, privacy, projection area density partition, pubcrawl, resilience, Resiliency, tree data structures, trees (mathematics), UCI, usability |
Abstract | K-anonymity is a popular model used in microdata publishing to protect individual privacy. This paper introduces the idea of ball tree and projection area density partition into k-anonymity algorithm.The traditional kd-tree implements the division by forming a super-rectangular, but the super-rectangular has the area angle, so it cannot guarantee that the records on the corner are most similar to the records in this area. In this paper, the super-sphere formed by the ball-tree is used to address this problem. We adopt projection area density partition to increase the density of the resulting recorded points. We implement our algorithm with the Gotrack dataset and the Adult dataset in UCI. The experimentation shows that the k-anonymity algorithm based on ball-tree and projection area density partition, obtains more anonymous groups, and the generalization rate is lower. The smaller the K is, the more obvious the result advantage is. The result indicates that our algorithm can make data usability even higher. |
URL | https://ieeexplore.ieee.org/document/8845384 |
DOI | 10.1109/ICCSE.2019.8845384 |
Citation Key | cheng_algorithm_2019 |
- Loss measurement
- usability
- UCI
- trees (mathematics)
- tree data structures
- Resiliency
- resilience
- pubcrawl
- projection area density partition
- privacy
- Partitioning algorithms
- microdata publishing
- Metrics
- Adult dataset
- kd-tree
- k-anonymity algorithm
- k-anonymity
- information science
- individual privacy protection
- Human behavior
- Gotrack dataset
- data privacy
- composability
- Classification algorithms
- ball-tree
- anonymity