Parallel Algorithms for Constructing Range and Nearest-Neighbor Searching Data Structures
Title | Parallel Algorithms for Constructing Range and Nearest-Neighbor Searching Data Structures |
Publication Type | Conference Paper |
Year of Publication | 2016 |
Authors | Agarwal, Pankaj K., Fox, Kyle, Munagala, Kamesh, Nath, Abhinandan |
Conference Name | Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-4191-2 |
Keywords | computational geometry, data structures, MapReduce, Metrics, nearest neighbor search, nearest-neighbor search, pubcrawl, range search, sampling |
Abstract | With the massive amounts of data available today, it is common to store and process data using multiple machines. Parallel programming platforms such as MapReduce and its variants are popular frameworks for handling such large data. We present the first provably efficient algorithms to compute, store, and query data structures for range queries and approximate nearest neighbor queries in a popular parallel computing abstraction that captures the salient features of MapReduce and other massively parallel communication (MPC) models. In particular, we describe algorithms for \$kd\$-trees, range trees, and BBD-trees that only require O(1) rounds of communication for both preprocessing and querying while staying competitive in terms of running time and workload to their classical counterparts. Our algorithms are randomized, but they can be made deterministic at some increase in their running time and workload while keeping the number of rounds of communication to be constant. |
URL | http://doi.acm.org/10.1145/2902251.2902303 |
DOI | 10.1145/2902251.2902303 |
Citation Key | agarwal_parallel_2016 |