Title | Towards Embedding Data Provenance in Files |
Publication Type | Conference Paper |
Year of Publication | 2021 |
Authors | Phua, Thye Way, Patros, Panos, Kumar, Vimal |
Conference Name | 2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC) |
Keywords | composability, Conferences, data provenance, Delta-encoding, Embedded Data Provenance, End-to-end Data Provenance, file system, Forensics, History, Human Behavior, Intrusion detection, Metrics, Prototypes, Provenance, pubcrawl, Resiliency, Self-contained Provenance, Stakeholders, usability |
Abstract | Data provenance (keeping track of who did what, where, when and how) boasts of various attractive use cases for distributed systems, such as intrusion detection, forensic analysis and secure information dependability. This potential, however, can only be realized if provenance is accessible by its primary stakeholders: the end-users. Existing provenance systems are designed in a `all-or-nothing' fashion, making provenance inaccessible, difficult to extract and crucially, not controlled by its key stakeholders. To mitigate this, we propose that provenance be separated into system, data-specific and file-metadata provenance. Furthermore, we expand data-specific provenance as changes at a fine-grain level, or provenance-per-change, that is recorded alongside its source. We show that with the use of delta-encoding, provenance-per-change is viable, asserting our proposed architecture to be effectively realizable. |
DOI | 10.1109/CCWC51732.2021.9375947 |
Citation Key | phua_towards_2021 |