VACCINE: Using Contextual Integrity For Data Leakage Detection
Title | VACCINE: Using Contextual Integrity For Data Leakage Detection |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Yan Shvartzshnaider, Zvonimir Pavlinovic, Ananth Balashankar, Thomas Wies, Lakshminarayanan Subramanian, Helen Nissenbaum, Prateek Mittal |
Conference Name | The World Wide Web Conference |
Date Published | 05/2019 |
Publisher | ACM |
Conference Location | San Francisco, CA, USA |
ISBN Number | 978-1-4503-6674-8 |
Keywords | 2019: July, Contextual Integrity, Contextual Integrity for Computer Systems, Data Leakage Detection, DLP, ICSI, Policy-Governed Secure Collaboration, privacy, Scalability and Composability |
Abstract | Modern enterprises rely on Data Leakage Prevention (DLP) systems to enforce privacy policies that prevent unintentional flow of sensitive information to unauthorized entities. However, these systems operate based on rule sets that are limited to syntactic analysis and therefore completely ignore the semantic relationships between participants involved in the information exchanges. For similar reasons, these systems cannot enforce complex privacy policies that require temporal reasoning about events that have previously occurred. To address these limitations, we advocate a new design methodology for DLP systems centered on the notion of Contextual Integrity (CI). We use the CI framework to abstract real-world communication exchanges into formally defined information flows where privacy policies describe sequences of admissible flows. CI allows us to decouple (1) the syntactic extraction of flows from information exchanges, and (2) the enforcement of privacy policies on these flows. We applied this approach to built VACCINE, a DLP auditing system for emails. VACCINE uses state-of-the-art techniques in natural language processing to extract flows from email text. It also provides a declarative language for describing privacy policies. These policies are automatically compiled to operational rules that the system uses for detecting data leakages. We evaluated VACCINE on the Enron email corpus and show that it improves over the state of the art both in terms of the expressivity of the policies that DLP systems can enforce as well as its precision in detecting data leakages. |
URL | http://doi.acm.org/10.1145/3308558.3313655 |
DOI | 10.1145/3308558.3313655 |
Citation Key | Shvartzshnaider:2019:VUC:3308558.3313655 |
Refereed Designation | Refereed |