Visible to the public Differentially Private Online Active Learning with Applications to Anomaly Detection

TitleDifferentially Private Online Active Learning with Applications to Anomaly Detection
Publication TypeConference Paper
Year of Publication2016
AuthorsGhassemi, Mohsen, Sarwate, Anand D., Wright, Rebecca N.
Conference NameProceedings of the 2016 ACM Workshop on Artificial Intelligence and Security
Date PublishedOctober 2016
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-4573-6
Keywordsactive learning, anomaly detection, composability, compositionality, Computational Intelligence, cryptography, Differential privacy, online learning, pubcrawl, Security Heuristics, stochastic gradient descent
Abstract

In settings where data instances are generated sequentially or in streaming fashion, online learning algorithms can learn predictors using incremental training algorithms such as stochastic gradient descent. In some security applications such as training anomaly detectors, the data streams may consist of private information or transactions and the output of the learning algorithms may reveal information about the training data. Differential privacy is a framework for quantifying the privacy risk in such settings. This paper proposes two differentially private strategies to mitigate privacy risk when training a classifier for anomaly detection in an online setting. The first is to use a randomized active learning heuristic to screen out uninformative data points in the stream. The second is to use mini-batching to improve classifier performance. Experimental results show how these two strategies can trade off privacy, label complexity, and generalization performance.

URLhttps://dl.acm.org/doi/10.1145/2996758.2996766
DOI10.1145/2996758.2996766
Citation Keyghassemi_differentially_2016