Visible to the public Machine Learning Algorithms Evaluation for Phishing URLs Classification

TitleMachine Learning Algorithms Evaluation for Phishing URLs Classification
Publication TypeConference Paper
Year of Publication2021
AuthorsBOUIJIJ, Habiba, BERQIA, Amine
Conference Name2021 4th International Symposium on Advanced Electrical and Communication Technologies (ISAECT)
Date Publisheddec
Keywordsaccuracy metric, cyberattack, cybersecurity, feature extraction, features, Human Behavior, Lexical Analysis, machine learning, machine learning algorithms, Measurement, phishing, Phishing-URL, pubcrawl, Radio frequency, Support vector machines, Uniform resource locators
AbstractPhishing URL is a type of cyberattack, based on falsified URLs. The number of phishing URL attacks continues to increase despite cybersecurity efforts. According to the Anti-Phishing Working Group (APWG), the number of phishing websites observed in 2020 is 1 520 832, doubling over the course of a year. Various algorithms, techniques and methods can be used to build models for phishing URL detection and classification. From our reading, we observed that Machine Learning (ML) is one of the recent approaches used to detect and classify phishing URL in an efficient and proactive way. In this paper, we evaluate eleven of the most adopted ML algorithms such as Decision Tree (DT), Nearest Neighbours (KNN), Gradient Boosting (GB), Logistic Regression (LR), Naive Bayes (NB), Random Forest (RF), Support Vector Machines (SVM), Neural Network (NN), Ex-tra\_Tree (ET), Ada\_Boost (AB) and Bagging (B). To do that, we compute detection accuracy metric for each algorithm and we use lexical analysis to extract the URL features.
DOI10.1109/ISAECT53699.2021.9668489
Citation Keybouijij_machine_2021