Title | Unsupervised Discovery Of Semantically Aware Communities With Tensor Kruskal Decomposition: A Case Study In Twitter |
Publication Type | Conference Paper |
Year of Publication | 2020 |
Authors | Drakopoulos, G., Giotopoulos, K., Giannoukou, I., Sioutas, S. |
Conference Name | 2020 15th International Workshop on Semantic and Social Media Adaptation and Personalization (SMA |
Date Published | oct |
Keywords | analytical methodologies, Blogs, clustered graph spectrum, Community discovery, community structure discovery, compact community size distribution, compositionality, cyber physical systems, decomposition, digital marketing, Facebook groups, fact checking, followfollower relationships, functional coherency, functional Twitter elements, graph theory, hashtag semantics, heavy degree distribution tail, high clustering coefficient values, higher order analytics, higher order counterpart, Julia, Kruskal decomposition, low diameter, Metrics, network theory (graphs), news aggregators, Periodic structures, Political Campaigns, politics, pubcrawl, rank one tensors, recursive community structure, semantic weight, semantically aware communities, Semantics, singular value decomposition, social network, social networking (online), Structural coherency, structural Twitter elements, substantial empirical evidence, SVD, synthetic graph generation models, tensor algebra, tensor kruskal decomposition, tensor representation, tensors, Tucker decomposition, Tucker tensor decomposition, Twitter, Twitter subgraph, unofficial form, unsupervised discovery, unsupervised learning |
Abstract | Substantial empirical evidence, including the success of synthetic graph generation models as well as of analytical methodologies, suggests that large, real graphs have a recursive community structure. The latter results, in part at least, in other important properties of these graphs such as low diameter, high clustering coefficient values, heavy degree distribution tail, and clustered graph spectrum. Notice that this structure need not be official or moderated like Facebook groups, but it can also take an ad hoc and unofficial form depending on the functionality of the social network under study as for instance the follow relationship on Twitter or the connections between news aggregators on Reddit. Community discovery is paramount in numerous applications such as political campaigns, digital marketing, crowdfunding, and fact checking. Here a tensor representation for Twitter subgraphs is proposed which takes into consideration both the followfollower relationships but also the coherency in hashtags. Community structure discovery then reduces to the computation of Tucker tensor decomposition, a higher order counterpart of the well-known unsupervised learning method of singular value decomposition (SVD). Tucker decomposition clearly outperforms the SVD in terms of finding a more compact community size distribution in experiments done in Julia on a Twitter subgraph. This can be attributed to the facts that the proposed methodology combines both structural and functional Twitter elements and that hashtags carry an increased semantic weight in comparison to ordinary tweets. |
DOI | 10.1109/SMAP49528.2020.9248469 |
Citation Key | drakopoulos_unsupervised_2020 |