Topic Evolution Modeling in Social Media Short Texts Based on Recurrent Semantic Dependent CRP
Title | Topic Evolution Modeling in Social Media Short Texts Based on Recurrent Semantic Dependent CRP |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Zhang, Y., Mao, W., Zeng, D. |
Conference Name | 2017 IEEE International Conference on Intelligence and Security Informatics (ISI) |
Date Published | jul |
Keywords | Adaptation models, Analytical models, composability, Data models, HDP, Human Behavior, human factors, LDA, Media, Metrics, pubcrawl, recurrent semantic dependent CRP, rsdCRP, Scalability, sdTEM, semantic dependent Chinese restaurant process, semantic similarity information, Semantics, short-text oriented topic evolution model, Social Media Analytics, social media short texts, Social network services, social networking (online), text analysis, text analytics, text mining, Tools, topic modeling, Twitter dataset, word co-occurrence modeling |
Abstract | Social media has become an important platform for people to express opinions, share information and communicate with others. Detecting and tracking topics from social media can help people grasp essential information and facilitate many security-related applications. As social media texts are usually short, traditional topic evolution models built based on LDA or HDP often suffer from the data sparsity problem. Recently proposed topic evolution models are more suitable for short texts, but they need to manually specify topic number which is fixed during different time period. To address these issues, in this paper, we propose a nonparametric topic evolution model for social media short texts. We first propose the recurrent semantic dependent Chinese restaurant process (rsdCRP), which is a nonparametric process incorporating word embeddings to capture semantic similarity information. Then we combine rsdCRP with word co-occurrence modeling and build our short-text oriented topic evolution model sdTEM. We carry out experimental studies on Twitter dataset. The results demonstrate the effectiveness of our method to monitor social media topic evolution compared to the baseline methods. |
URL | http://ieeexplore.ieee.org/document/8004885/ |
DOI | 10.1109/ISI.2017.8004885 |
Citation Key | zhang_topic_2017 |
- semantic dependent Chinese restaurant process
- word co-occurrence modeling
- Twitter dataset
- topic modeling
- tools
- Text Mining
- text analytics
- text analysis
- social networking (online)
- Social network services
- social media short texts
- Social Media Analytics
- short-text oriented topic evolution model
- Semantics
- semantic similarity information
- Adaptation models
- sdTEM
- Scalability
- rsdCRP
- recurrent semantic dependent CRP
- pubcrawl
- Metrics
- Media
- LDA
- Human Factors
- Human behavior
- HDP
- Data models
- composability
- Analytical models