Visible to the public Optimizing Hash Partitioning for Solid State Drives

TitleOptimizing Hash Partitioning for Solid State Drives
Publication TypeConference Paper
Year of Publication2016
AuthorsShin, Mincheol, Roh, Hongchan, Jung, Wonmook, Park, Sanghyun
Conference NameProceedings of the 31st Annual ACM Symposium on Applied Computing
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-3739-7
KeywordsAlgorithm, composability, flash storage devices, hash algorithms, hash join, hash partitioning, internal parallelism of flashSSDs, Metrics, pubcrawl, query execution, Resiliency, Scalability
Abstract

The use of flashSSDs has increased rapidly in a wide range of areas due to their superior energy efficiency, shorter access time, and higher bandwidth when compared to HDDs. The internal parallelism created by multiple flash memory packages embedded in a flashSSDs, is one of the unique features of flashSSDs. Many new DBMS technologies have been developed for flashSSDs, but query processing for flashSSDs have drawn less attention than other DBMS technologies. Hash partitioning is popularly used in query processing algorithms to materialize their intermediate results in an efficient manner. In this paper, we propose a novel hash partitioning algorithm that exploits the internal parallelism of flashSSDs. The devised hash partitioning method outperforms the traditional hash partitioning technique regardless of the amount of available main memory independently from the buffer management strategies (blocked I/O vs page sized I/O). We implemented our method based on the source code of the PostgreSQL storage manager. PostgreSQL relation files created by the TPC-H workload were employed in the experiments. Our method was found to be up to 3.55 times faster than the traditional method with blocked I/O, and 2.36 times faster than the traditional method with pagesized I/O.

URLhttp://doi.acm.org/10.1145/2851613.2851663
DOI10.1145/2851613.2851663
Citation Keyshin_optimizing_2016