Visible to the public A lightweight machine learning based security framework for detecting phishing attacks

TitleA lightweight machine learning based security framework for detecting phishing attacks
Publication TypeConference Paper
Year of Publication2021
AuthorsKumar, Yogendra, Subba, Basant
Conference Name2021 International Conference on COMmunication Systems & NETworkS (COMSNETS)
Keywordsfeature extraction, Human Behavior, kaggle dataset1, KNN, logistic regression, machine learning, Neural Network, phishing, phishing attacks, pubcrawl, Random Forest, Real-time Systems, security, SVM, Transforms, UNB dataset2, Uniform resource locators
AbstractA successful phishing attack is prelude to various other severe attacks such as login credentials theft, unauthorized access to user's confidential data, malware and ransomware infestation of victim's machine etc. This paper proposes a real time lightweight machine learning based security framework for detection of phishing attacks through analysis of Uniform Resource Locators (URLs). The proposed framework initially extracts a set of highly discriminating and uncorrelated features from the URL string corpus. These extracted features are then used to transform the URL strings into their corresponding numeric feature vectors, which are eventually used to train various machine learning based classifier models for identification of malicious phishing URLs. Performance analysis of the proposed security framework on two well known datasets: Kaggle dataset and UNB dataset shows that it is capable of detecting malicious phishing URLs with high precision, while at the same time maintain a very low level of false positive rate. The proposed framework is also shown to outperform other similar security frameworks proposed in the literature.121https://www.kaggle.com/antonyj453/ur1dataset2https://www.unb.ca/cic/datasets/ur1-2016.htm1
DOI10.1109/COMSNETS51098.2021.9352828
Citation Keykumar_lightweight_2021