SuperMan: A Novel System for Storing and Retrieving Scientific-Simulation Provenance for Efficient Job Executions on Computing Clusters
Title | SuperMan: A Novel System for Storing and Retrieving Scientific-Simulation Provenance for Efficient Job Executions on Computing Clusters |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Suh, Y. K., Ma, J. |
Conference Name | 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS*W) |
Date Published | sep |
Keywords | composability, compositionality, Computational modeling, compute-intensive simulations, computing clusters, Conferences, Data models, digital simulation, EDISON, HPC, Human Behavior, human factors, information retrieval, interoperability, job executions, Metrics, open systems, PROV, Provenance, pubcrawl, Recycling, Resiliency, Scientific computing, scientific information systems, scientific-simulation provenance retrieval, scientific-simulation provenance storage, simulation, SimUlation ProvEnance Recycling MANager, storage management, SuperMan |
Abstract | Compute-intensive simulations typically charge substantial workloads on an online simulation platform backed by limited computing clusters and storage resources. Some (or most) of the simulations initiated by users may accompany input parameters/files that have been already provided by other (or same) users in the past. Unfortunately, these duplicate simulations may aggravate the performance of the platform by drastic consumption of the limited resources shared by a number of users on the platform. To minimize or avoid conducting repeated simulations, we present a novel system, called SUPERMAN (SimUlation ProvEnance Recycling MANager) that can record simulation provenances and recycle the results of past simulations. This system presents a great opportunity to not only reutilize existing results but also perform various analytics helpful for those who are not familiar with the platform. The system also offers interoperability across other systems by collecting the provenances in a standardized format. In our simulated experiments we found that over half of past computing jobs could be answered without actual executions by our system. |
URL | http://ieeexplore.ieee.org/document/8064136/ |
DOI | 10.1109/FAS-W.2017.160 |
Citation Key | suh_superman:_2017 |
- Metrics
- SuperMan
- storage management
- SimUlation ProvEnance Recycling MANager
- simulation
- scientific-simulation provenance storage
- scientific-simulation provenance retrieval
- scientific information systems
- scientific computing
- Resiliency
- Recycling
- pubcrawl
- Provenance
- PROV
- open systems
- composability
- job executions
- interoperability
- information retrieval
- Human Factors
- Human behavior
- HPC
- EDISON
- digital simulation
- Data models
- Conferences
- computing clusters
- compute-intensive simulations
- Computational modeling
- Compositionality