Unsupervised Clickstream Clustering for User Behavior Analysis
Title | Unsupervised Clickstream Clustering for User Behavior Analysis |
Publication Type | Conference Paper |
Year of Publication | 2016 |
Authors | Wang, Gang, Zhang, Xinyi, Tang, Shiliang, Zheng, Haitao, Zhao, Ben Y. |
Conference Name | Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems |
Date Published | May 2016 |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-3362-7 |
Keywords | clickstream analysis, composability, edge detection, Metrics, pubcrawl, Resiliency, Scalability, security, user behavioral model, visualization |
Abstract | Online services are increasingly dependent on user participation. Whether it's online social networks or crowdsourcing services, understanding user behavior is important yet challenging. In this paper, we build an unsupervised system to capture dominating user behaviors from clickstream data (traces of users' click events), and visualize the detected behaviors in an intuitive manner. Our system identifies "clusters" of similar users by partitioning a similarity graph (nodes are users; edges are weighted by clickstream similarity). The partitioning process leverages iterative feature pruning to capture the natural hierarchy within user clusters and produce intuitive features for visualizing and understanding captured user behaviors. For evaluation, we present case studies on two large-scale clickstream traces (142 million events) from real social networks. Our system effectively identifies previously unknown behaviors, e.g., dormant users, hostile chatters. Also, our user study shows people can easily interpret identified behaviors using our visualization tool. |
URL | https://dl.acm.org/doi/10.1145/2858036.2858107 |
DOI | 10.1145/2858036.2858107 |
Citation Key | wang_unsupervised_2016 |