Differentially Private Online Active Learning with Applications to Anomaly Detection

Submitted by grigby1 on Mon, 07/24/2017 - 1:52pm

Title	Differentially Private Online Active Learning with Applications to Anomaly Detection
Publication Type	Conference Paper
Year of Publication	2016
Authors	Ghassemi, Mohsen, Sarwate, Anand D., Wright, Rebecca N.
Conference Name	Proceedings of the 2016 ACM Workshop on Artificial Intelligence and Security
Publisher	ACM
Conference Location	New York, NY, USA
ISBN Number	978-1-4503-4573-6
Keywords	active learning, anomaly detection, control theory, Differential privacy, online learning, privacy, pubcrawl, Resiliency, stochastic gradient descent
Abstract	In settings where data instances are generated sequentially or in streaming fashion, online learning algorithms can learn predictors using incremental training algorithms such as stochastic gradient descent. In some security applications such as training anomaly detectors, the data streams may consist of private information or transactions and the output of the learning algorithms may reveal information about the training data. Differential privacy is a framework for quantifying the privacy risk in such settings. This paper proposes two differentially private strategies to mitigate privacy risk when training a classifier for anomaly detection in an online setting. The first is to use a randomized active learning heuristic to screen out uninformative data points in the stream. The second is to use mini-batching to improve classifier performance. Experimental results show how these two strategies can trade off privacy, label complexity, and generalization performance.
URL	http://doi.acm.org/10.1145/2996758.2996766
DOI	10.1145/2996758.2996766
Citation Key	ghassemi_differentially_2016

Groups:

Science of Security VO