Visible to the public Modified RNP Privacy Protection Data Mining Method as Big Data Security

TitleModified RNP Privacy Protection Data Mining Method as Big Data Security
Publication TypeConference Paper
Year of Publication2021
AuthorsYasa, Ray Novita, Buana, I Komang Setia, Girinoto, Setiawan, Hermawan, Hadiprakoso, Raden Budiarto
Conference Name2021 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS
KeywordsBig Data, big data security metrics, data mining, data privacy, data utility, Interference, Measurement, Multimedia systems, Perturbation methods, PPDM, privacy metrics, pubcrawl, resilience, Resiliency, RNP, Scalability, Support vector machines
AbstractPrivacy-Preserving Data Mining (PPDM) has become an exciting topic to discuss in recent decades due to the growing interest in big data and data mining. A technique of securing data but still preserving the privacy that is in it. This paper provides an alternative perturbation-based PPDM technique which is carried out by modifying the RNP algorithm. The novelty given in this paper are modifications of some steps method with a specific purpose. The modifications made are in the form of first narrowing the selection of the disturbance value. With the aim that the number of attributes that are replaced in each record line is only as many as the attributes in the original data, no more and no need to repeat; secondly, derive the perturbation function from the cumulative distribution function and use it to find the probability distribution function so that the selection of replacement data has a clear basis. The experiment results on twenty-five perturbed data show that the modified RNP algorithm balances data utility and security level by selecting the appropriate disturbance value and perturbation value. The level of security is measured using privacy metrics in the form of value difference, average transformation of data, and percentage of retains. The method presented in this paper is fascinating to be applied to actual data that requires privacy preservation.
DOI10.1109/ICIMCIS53775.2021.9699180
Citation Keyyasa_modified_2021