Visible to the public An Architecture for Privacy-Preserving and Replicable High-Recall Retrieval Experiments

TitleAn Architecture for Privacy-Preserving and Replicable High-Recall Retrieval Experiments
Publication TypeConference Paper
Year of Publication2016
AuthorsRoegiest, Adam, Cormack, Gordon V.
Conference NameProceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-4069-4
Keywordsexperimentation, high recall, privacy, pubcrawl, rest, security, virtual machine, virtual machine security, virtualization privacy

We demonstrate the infrastructure used in the TREC 2015 Total Recall track to facilitate controlled simulation of "assessor in the loop" high-recall retrieval experimentation. The implementation and corresponding design decisions are presented for this platform. This includes the necessary considerations to ensure that experiments are privacy-preserving when using test collections that cannot be distributed. Furthermore, we describe the use of virtual machines as a means of system submission in order to to promote replicable experiments while also ensuring the security of system developers and data providers.

Citation Keyroegiest_architecture_2016