Analysis of Checkpointing Overhead in Parallel State Machine Replication
Title | Analysis of Checkpointing Overhead in Parallel State Machine Replication |
Publication Type | Conference Paper |
Year of Publication | 2016 |
Authors | Mendizabal, Odorico M., Dotti, Fernando Luís, Pedone, Fernando |
Conference Name | Proceedings of the 31st Annual ACM Symposium on Applied Computing |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-3739-7 |
Keywords | checkpointing, Distributed Systems, Fault tolerance, pubcrawl, Resiliency, System recovery |
Abstract | State machine replication (SMR) is a well-established technique to fault-tolerant systems. In part, this is explained by the simplicity of the approach and its strong consistency guarantees. Recently, several proposals have suggested parallelizing the execution of state machine replicas to achieve high throughput. Concurrent execution of commands has many implications, including the recovery of replicas from failures. Conventional checkpointing techniques, for example, must be revisited in parallelized models. In this paper, we review parallel variations of state machine replication and discuss how checkpointing procedures apply to these models. Moreover, we evaluate the impact caused by checkpointing techniques on recovery through simulations. |
URL | http://doi.acm.org/10.1145/2851613.2851879 |
DOI | 10.1145/2851613.2851879 |
Citation Key | mendizabal_analysis_2016 |