Visible to the public SciTokens: Capability-Based Secure Access to Remote Scientific Data

TitleSciTokens: Capability-Based Secure Access to Remote Scientific Data
Publication TypeConference Paper
Year of Publication2018
AuthorsWithers, Alex, Bockelman, Brian, Weitzel, Derek, Brown, Duncan, Gaynor, Jeff, Basney, Jim, Tannenbaum, Todd, Miller, Zach
Conference NameProceedings of the Practice and Experience on Advanced Research Computing
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-6446-1
Keywordscapabilities, compositionality, distributed computing, oAuth, Predictive Metrics, pubcrawl, Resiliency, Scientific Computing Security
AbstractThe management of security credentials (e.g., passwords, secret keys) for computational science workflows is a burden for scientists and information security officers. Problems with credentials (e.g., expiration, privilege mismatch) cause workflows to fail to fetch needed input data or store valuable scientific results, distracting scientists from their research by requiring them to diagnose the problems, re-run their computations, and wait longer for their results. In this paper, we introduce SciTokens, open source software to help scientists manage their security credentials more reliably and securely. We describe the SciTokens system architecture, design, and implementation addressing use cases from the Laser Interferometer Gravitational-Wave Observatory (LIGO) Scientific Collaboration and the Large Synoptic Survey Telescope (LSST) projects. We also present our integration with widely-used software that supports distributed scientific computing, including HTCondor, CVMFS, and XrootD. SciTokens uses IETF-standard OAuth tokens for capability-based secure access to remote scientific data. The access tokens convey the specific authorizations needed by the workflows, rather than general-purpose authentication impersonation credentials, to address the risks of scientific workflows running on distributed infrastructure including NSF resources (e.g., LIGO Data Grid, Open Science Grid, XSEDE) and public clouds (e.g., Amazon Web Services, Google Cloud, Microsoft Azure). By improving the interoperability and security of scientific workflows, SciTokens 1) enables use of distributed computing for scientific domains that require greater data protection and 2) enables use of more widely distributed computing resources by reducing the risk of credential abuse on remote systems.
URLhttp://doi.acm.org/10.1145/3219104.3219135
DOI10.1145/3219104.3219135
Citation Keywithers_scitokens:_2018