Visible to the public Unsupervised Clickstream Clustering for User Behavior Analysis

TitleUnsupervised Clickstream Clustering for User Behavior Analysis
Publication TypeConference Paper
Year of Publication2016
AuthorsWang, Gang, Zhang, Xinyi, Tang, Shiliang, Zheng, Haitao, Zhao, Ben Y.
Conference NameProceedings of the 2016 CHI Conference on Human Factors in Computing Systems
Date PublishedMay 2016
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-3362-7
Keywordsclickstream analysis, composability, edge detection, Metrics, pubcrawl, Resiliency, Scalability, security, user behavioral model, visualization
Abstract

Online services are increasingly dependent on user participation. Whether it's online social networks or crowdsourcing services, understanding user behavior is important yet challenging. In this paper, we build an unsupervised system to capture dominating user behaviors from clickstream data (traces of users' click events), and visualize the detected behaviors in an intuitive manner. Our system identifies "clusters" of similar users by partitioning a similarity graph (nodes are users; edges are weighted by clickstream similarity). The partitioning process leverages iterative feature pruning to capture the natural hierarchy within user clusters and produce intuitive features for visualizing and understanding captured user behaviors. For evaluation, we present case studies on two large-scale clickstream traces (142 million events) from real social networks. Our system effectively identifies previously unknown behaviors, e.g., dormant users, hostile chatters. Also, our user study shows people can easily interpret identified behaviors using our visualization tool.

URLhttps://dl.acm.org/doi/10.1145/2858036.2858107
DOI10.1145/2858036.2858107
Citation Keywang_unsupervised_2016