Visible to the public Romulus: Efficient Algorithms for Persistent Transactional Memory

TitleRomulus: Efficient Algorithms for Persistent Transactional Memory
Publication TypeConference Paper
Year of Publication2018
AuthorsCorreia, Andreia, Felber, Pascal, Ramalhete, Pedro
Conference NameProceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-5799-9
KeywordsComputing Theory and Resilience, Concurrency, crash resilience, failure atomicity, persistent memory, pubcrawl, Resiliency, transactions
AbstractByte addressable persistent memory eliminates the need for serialization and deserialization of data, to and from persistent storage, allowing applications to interact with it through common store and load instructions. In the event of a process or system failure, applications rely on persistent techniques to provide consistent storage of data in non-volatile memory (NVM). For most of these techniques, consistency is ensured through logging of updates, with consequent intensive cache line flushing and persistent fences necessary to guarantee correctness. Undo log based approaches require store interposition and persistence fences before each in-place modification. Redo log based techniques can execute transactions using just two persistence fences, although they require store and load interposition which may incur a performance penalty for large transactions. So far, these techniques have been difficult to integrate with known memory allocators, requiring allocators or garbage collectors specifically designed for NVM. We present Romulus, a user-level library persistent transactional memory (PTM) which provides durable transactions through the usage of twin copies of the data. A transaction in Romulus requires at most four persistence fences, regardless of the transaction size. Romulus uses only store interposition. Any sequential implementation of a memory allocator can be adapted to work with Romulus. Thanks to its lightweight design and low synchronization overhead, Romulus achieves twice the throughput of current state of the art PTMs in update-only workloads, and more than one order of magnitude in read-mostly scenarios.
URLhttp://doi.acm.org/10.1145/3210377.3210392
DOI10.1145/3210377.3210392
Citation Keycorreia_romulus:_2018