Differentially Private Online Active Learning with Applications to Anomaly Detection
Title | Differentially Private Online Active Learning with Applications to Anomaly Detection |
Publication Type | Conference Paper |
Year of Publication | 2016 |
Authors | Ghassemi, Mohsen, Sarwate, Anand D., Wright, Rebecca N. |
Conference Name | Proceedings of the 2016 ACM Workshop on Artificial Intelligence and Security |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-4573-6 |
Keywords | active learning, anomaly detection, control theory, Differential privacy, online learning, privacy, pubcrawl, Resiliency, stochastic gradient descent |
Abstract | In settings where data instances are generated sequentially or in streaming fashion, online learning algorithms can learn predictors using incremental training algorithms such as stochastic gradient descent. In some security applications such as training anomaly detectors, the data streams may consist of private information or transactions and the output of the learning algorithms may reveal information about the training data. Differential privacy is a framework for quantifying the privacy risk in such settings. This paper proposes two differentially private strategies to mitigate privacy risk when training a classifier for anomaly detection in an online setting. The first is to use a randomized active learning heuristic to screen out uninformative data points in the stream. The second is to use mini-batching to improve classifier performance. Experimental results show how these two strategies can trade off privacy, label complexity, and generalization performance. |
URL | http://doi.acm.org/10.1145/2996758.2996766 |
DOI | 10.1145/2996758.2996766 |
Citation Key | ghassemi_differentially_2016 |