Visible to the public Towards Embedding Data Provenance in Files

TitleTowards Embedding Data Provenance in Files
Publication TypeConference Paper
Year of Publication2021
AuthorsPhua, Thye Way, Patros, Panos, Kumar, Vimal
Conference Name2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC)
Keywordscomposability, Conferences, data provenance, Delta-encoding, Embedded Data Provenance, End-to-end Data Provenance, file system, Forensics, History, Human Behavior, Intrusion detection, Metrics, Prototypes, Provenance, pubcrawl, Resiliency, Self-contained Provenance, Stakeholders, usability
AbstractData provenance (keeping track of who did what, where, when and how) boasts of various attractive use cases for distributed systems, such as intrusion detection, forensic analysis and secure information dependability. This potential, however, can only be realized if provenance is accessible by its primary stakeholders: the end-users. Existing provenance systems are designed in a `all-or-nothing' fashion, making provenance inaccessible, difficult to extract and crucially, not controlled by its key stakeholders. To mitigate this, we propose that provenance be separated into system, data-specific and file-metadata provenance. Furthermore, we expand data-specific provenance as changes at a fine-grain level, or provenance-per-change, that is recorded alongside its source. We show that with the use of delta-encoding, provenance-per-change is viable, asserting our proposed architecture to be effectively realizable.
DOI10.1109/CCWC51732.2021.9375947
Citation Keyphua_towards_2021