Comment Spam Detection via Effective Features Combination
Title | Comment Spam Detection via Effective Features Combination |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Li, Meng, Wu, Bin, Wang, Yaning |
Conference Name | ICC 2019 - 2019 IEEE International Conference on Communications (ICC) |
Date Published | may |
ISBN Number | 978-1-5386-8088-9 |
Keywords | blog spam dataset, Blogs, boosting, comment spam detection, Decision trees, Detectors, feature combination, feature extraction, gradient boosting tree algorithm, Human Behavior, human factors, Internet, Metrics, pubcrawl, Scalability, spam detection, trees (mathematics), user experience, Web sites |
Abstract | Comment spam is one of the great challenges faced by forum administrators. Detecting and blocking comment spam can relieve the load on servers, improve user experience and purify the network conditions. This paper focuses on the detection of comment spam. The behaviors of spammer and the content of spam were analyzed. According to analysis results, two types of effective features are extracted which can make a better description of spammer characteristics. Additionally, a gradient boosting tree algorithm was used to construct the comment spam detector based on the extracted features. Our proposed method is examined on a blog spam dataset which was published by previous research, and the result illustrates that our method performs better than the previous method on detection accuracy. Moreover, the CPU time is recorded to demonstrate that the time spent on both training and testing maintains a small value. |
URL | https://ieeexplore.ieee.org/document/8761340 |
DOI | 10.1109/ICC.2019.8761340 |
Citation Key | li_comment_2019 |