Visible to the public Information Entropy Differential Privacy: A Differential Privacy Protection Data Method Based on Rough Set Theory

TitleInformation Entropy Differential Privacy: A Differential Privacy Protection Data Method Based on Rough Set Theory
Publication TypeConference Paper
Year of Publication2019
AuthorsLi, Xianxian, Luo, Chunfeng, Liu, Peng, Wang, Li-e
Conference Name2019 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech)
Keywordsassociated data, Association data, behavioral prediction, composability, Computing Theory and Privacy, Correlation, correlation data privacy issues, data correlations, data privacy, Differential privacy, differential privacy protection data method, Entropy, Human Behavior, information entropy, information entropy difference privacy, information entropy differential privacy solution, personal sensitive information, personalized noise, privacy, privacy protection, pubcrawl, random noise, Resiliency, rough set theory, Rough sets, Scalability, Sensitivity, use information entropy
Abstract

Data have become an important asset for analysis and behavioral prediction, especially correlations between data. Privacy protection has aroused academic and social concern given the amount of personal sensitive information involved in data. However, existing works assume that the records are independent of each other, which is unsuitable for associated data. Many studies either fail to achieve privacy protection or lead to excessive loss of information while applying data correlations. Differential privacy, which achieves privacy protection by injecting random noise into the statistical results given the correlation, will improve the background knowledge of adversaries. Therefore, this paper proposes an information entropy differential privacy solution for correlation data privacy issues based on rough set theory. Under the solution, we use rough set theory to measure the degree of association between attributes and use information entropy to quantify the sensitivity of the attribute. The information entropy difference privacy is achieved by clustering based on the correlation and adding personalized noise to each cluster while preserving the correlations between data. Experiments show that our algorithm can effectively preserve the correlation between the attributes while protecting privacy.

DOI10.1109/DASC/PiCom/CBDCom/CyberSciTech.2019.00169
Citation Keyli_information_2019