Visible to the public Robustness of Network Metrics in the Context of Digital Communication DataConflict Detection Enabled

TitleRobustness of Network Metrics in the Context of Digital Communication Data
Publication TypeConference Proceedings
Year of Publication2015
AuthorsJu-Sung Lee, Jurgen Pfeffer
Conference NameHICSS '15 Proceedings of the 2015 48th Hawaii International Conference on System Sciences
Date Published01/2015
PublisherIEEE Computer Society Washington, DC, USA ©2015
Conference LocationKauai, HI
ISBN978-1-4799-7367-5
KeywordsApr'15, CMU, digital communication, network analysis, sampling
Abstract

Social media data and other web-based network data are large and dynamic rendering the identification of structural changes in such systems a hard problem. Typically, online data is constantly streaming and results in data that is incomplete thus necessitating the need to understand the robustness of network metrics on partial or sampled network data. In this paper, we examine the effects of sampling on key network centrality metrics using two empirical communication datasets. Correlations between network metrics of original and sampled nodes offer a measure of sampling accuracy. The relationship between sampling and accuracy is convergent and amenable to nonlinear analysis. Naturally, larger edge samples induce sampled graphs that are more representative of the original graph. However, this effect is attenuated when larger sets of nodes are recovered in the samples. Also, we find that the graph structure plays a prominent role in sampling accuracy. Centralized graphs, in which fewer nodes enjoy higher centrality scores, offer more representative samples.

DOI10.1109/HICSS.2015.217
Citation Keynode-30184

Other available formats:

Lee_Robustness_JP.pdf
AttachmentTaxonomyKindSize
Lee_Robustness_JP.pdfPDF document1.07 MBDownloadPreview
AttachmentSize
bytes