Visible to the public Unsupervised Discovery Of Semantically Aware Communities With Tensor Kruskal Decomposition: A Case Study In Twitter

TitleUnsupervised Discovery Of Semantically Aware Communities With Tensor Kruskal Decomposition: A Case Study In Twitter
Publication TypeConference Paper
Year of Publication2020
AuthorsDrakopoulos, G., Giotopoulos, K., Giannoukou, I., Sioutas, S.
Conference Name2020 15th International Workshop on Semantic and Social Media Adaptation and Personalization (SMA
Date Publishedoct
Keywordsanalytical methodologies, Blogs, clustered graph spectrum, Community discovery, community structure discovery, compact community size distribution, compositionality, cyber physical systems, decomposition, digital marketing, Facebook groups, fact checking, followfollower relationships, functional coherency, functional Twitter elements, graph theory, hashtag semantics, heavy degree distribution tail, high clustering coefficient values, higher order analytics, higher order counterpart, Julia, Kruskal decomposition, low diameter, Metrics, network theory (graphs), news aggregators, Periodic structures, Political Campaigns, politics, pubcrawl, rank one tensors, recursive community structure, semantic weight, semantically aware communities, Semantics, singular value decomposition, social network, social networking (online), Structural coherency, structural Twitter elements, substantial empirical evidence, SVD, synthetic graph generation models, tensor algebra, tensor kruskal decomposition, tensor representation, tensors, Tucker decomposition, Tucker tensor decomposition, Twitter, Twitter subgraph, unofficial form, unsupervised discovery, unsupervised learning
AbstractSubstantial empirical evidence, including the success of synthetic graph generation models as well as of analytical methodologies, suggests that large, real graphs have a recursive community structure. The latter results, in part at least, in other important properties of these graphs such as low diameter, high clustering coefficient values, heavy degree distribution tail, and clustered graph spectrum. Notice that this structure need not be official or moderated like Facebook groups, but it can also take an ad hoc and unofficial form depending on the functionality of the social network under study as for instance the follow relationship on Twitter or the connections between news aggregators on Reddit. Community discovery is paramount in numerous applications such as political campaigns, digital marketing, crowdfunding, and fact checking. Here a tensor representation for Twitter subgraphs is proposed which takes into consideration both the followfollower relationships but also the coherency in hashtags. Community structure discovery then reduces to the computation of Tucker tensor decomposition, a higher order counterpart of the well-known unsupervised learning method of singular value decomposition (SVD). Tucker decomposition clearly outperforms the SVD in terms of finding a more compact community size distribution in experiments done in Julia on a Twitter subgraph. This can be attributed to the facts that the proposed methodology combines both structural and functional Twitter elements and that hashtags carry an increased semantic weight in comparison to ordinary tweets.
DOI10.1109/SMAP49528.2020.9248469
Citation Keydrakopoulos_unsupervised_2020