Title | Romulus: Efficient Algorithms for Persistent Transactional Memory |
Publication Type | Conference Paper |
Year of Publication | 2018 |
Authors | Correia, Andreia, Felber, Pascal, Ramalhete, Pedro |
Conference Name | Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-5799-9 |
Keywords | Computing Theory and Resilience, Concurrency, crash resilience, failure atomicity, persistent memory, pubcrawl, Resiliency, transactions |
Abstract | Byte addressable persistent memory eliminates the need for serialization and deserialization of data, to and from persistent storage, allowing applications to interact with it through common store and load instructions. In the event of a process or system failure, applications rely on persistent techniques to provide consistent storage of data in non-volatile memory (NVM). For most of these techniques, consistency is ensured through logging of updates, with consequent intensive cache line flushing and persistent fences necessary to guarantee correctness. Undo log based approaches require store interposition and persistence fences before each in-place modification. Redo log based techniques can execute transactions using just two persistence fences, although they require store and load interposition which may incur a performance penalty for large transactions. So far, these techniques have been difficult to integrate with known memory allocators, requiring allocators or garbage collectors specifically designed for NVM. We present Romulus, a user-level library persistent transactional memory (PTM) which provides durable transactions through the usage of twin copies of the data. A transaction in Romulus requires at most four persistence fences, regardless of the transaction size. Romulus uses only store interposition. Any sequential implementation of a memory allocator can be adapted to work with Romulus. Thanks to its lightweight design and low synchronization overhead, Romulus achieves twice the throughput of current state of the art PTMs in update-only workloads, and more than one order of magnitude in read-mostly scenarios. |
URL | http://doi.acm.org/10.1145/3210377.3210392 |
DOI | 10.1145/3210377.3210392 |
Citation Key | correia_romulus:_2018 |