Visible to the public Compressing Graphs and Indexes with Recursive Graph Bisection

TitleCompressing Graphs and Indexes with Recursive Graph Bisection
Publication TypeConference Paper
Year of Publication2016
AuthorsDhulipala, Laxman, Kabiljo, Igor, Karrer, Brian, Ottaviano, Giuseppe, Pupyrev, Sergey, Shalita, Alon
Conference NameProceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-4232-2
KeywordsAlgorithms, Approximation algorithms, Compression, data mining, distributed algorithms, experimentation, graph algorithms, graph theory, Human Behavior, linear arrangement, malware analysis, Metrics, privacy, pubcrawl, Resiliency, social networks
Abstract

Graph reordering is a powerful technique to increase the locality of the representations of graphs, which can be helpful in several applications. We study how the technique can be used to improve compression of graphs and inverted indexes. We extend the recent theoretical model of Chierichetti et al. (KDD 2009) for graph compression, and show how it can be employed for compression-friendly reordering of social networks and web graphs and for assigning document identifiers in inverted indexes. We design and implement a novel theoretically sound reordering algorithm that is based on recursive graph bisection. Our experiments show a significant improvement of the compression rate of graph and indexes over existing heuristics. The new method is relatively simple and allows efficient parallel and distributed implementations, which is demonstrated on graphs with billions of vertices and hundreds of billions of edges.

URLhttp://doi.acm.org/10.1145/2939672.2939862
DOI10.1145/2939672.2939862
Citation Keydhulipala_compressing_2016