Detection of Hurriedly Created Abnormal Profiles in Recommender Systems
Title | Detection of Hurriedly Created Abnormal Profiles in Recommender Systems |
Publication Type | Conference Paper |
Year of Publication | 2018 |
Authors | Panagiotakis, C., Papadakis, H., Fragopoulou, P. |
Conference Name | 2018 International Conference on Intelligent Systems (IS) |
ISBN Number | 978-1-5386-7097-2 |
Keywords | anomalous rating profiles, attackers, clustering methods, dimensionality reduction, feature extraction, gaussian distribution, Hidden Markov models, human factors, hurriedly created abnormal profiles, information filtering, k-means clustering, Labeling, learning (artificial intelligence), Outliers, pattern clustering, profile injection attacks, pubcrawl, Random Forest, random ratings, recommender systems, resilience, Resiliency, Scalability, security of data, specific items, specific ratings, spurious profiles, synthetic coordinates, system ratings, user rating behavior, user-item rating matrix |
Abstract | Recommender systems try to predict the preferences of users for specific items. These systems suffer from profile injection attacks, where the attackers have some prior knowledge of the system ratings and their goal is to promote or demote a particular item introducing abnormal (anomalous) ratings. The detection of both cases is a challenging problem. In this paper, we propose a framework to spot anomalous rating profiles (outliers), where the outliers hurriedly create a profile that injects into the system either random ratings or specific ratings, without any prior knowledge of the existing ratings. The proposed detection method is based on the unpredictable behavior of the outliers in a validation set, on the user-item rating matrix and on the similarity between users. The proposed system is totally unsupervised, and in the last step it uses the k-means clustering method automatically spotting the spurious profiles. For the cases where labeling sample data is available, a random forest classifier is trained to show how supervised methods outperforms unsupervised ones. Experimental results on the MovieLens 100k and the MovieLens 1M datasets demonstrate the high performance of the proposed schemata. |
URL | https://ieeexplore.ieee.org/document/8710589 |
DOI | 10.1109/IS.2018.8710589 |
Citation Key | panagiotakis_detection_2018 |
- profile injection attacks
- user-item rating matrix
- user rating behavior
- system ratings
- synthetic coordinates
- spurious profiles
- specific ratings
- specific items
- security of data
- Scalability
- Resiliency
- resilience
- recommender systems
- random ratings
- Random Forest
- pubcrawl
- anomalous rating profiles
- pattern clustering
- Outliers
- learning (artificial intelligence)
- Labeling
- k-means clustering
- information filtering
- hurriedly created abnormal profiles
- Human Factors
- Hidden Markov models
- gaussian distribution
- feature extraction
- dimensionality reduction
- clustering methods
- attackers