Machine Learning Algorithms Evaluation for Phishing URLs Classification

Submitted by grigby1 on Wed, 10/12/2022 - 2:39pm

Title	Machine Learning Algorithms Evaluation for Phishing URLs Classification
Publication Type	Conference Paper
Year of Publication	2021
Authors	BOUIJIJ, Habiba, BERQIA, Amine
Conference Name	2021 4th International Symposium on Advanced Electrical and Communication Technologies (ISAECT)
Date Published	dec
Keywords	accuracy metric, cyberattack, cybersecurity, feature extraction, features, Human Behavior, Lexical Analysis, machine learning, machine learning algorithms, Measurement, phishing, Phishing-URL, pubcrawl, Radio frequency, Support vector machines, Uniform resource locators
Abstract	Phishing URL is a type of cyberattack, based on falsified URLs. The number of phishing URL attacks continues to increase despite cybersecurity efforts. According to the Anti-Phishing Working Group (APWG), the number of phishing websites observed in 2020 is 1 520 832, doubling over the course of a year. Various algorithms, techniques and methods can be used to build models for phishing URL detection and classification. From our reading, we observed that Machine Learning (ML) is one of the recent approaches used to detect and classify phishing URL in an efficient and proactive way. In this paper, we evaluate eleven of the most adopted ML algorithms such as Decision Tree (DT), Nearest Neighbours (KNN), Gradient Boosting (GB), Logistic Regression (LR), Naive Bayes (NB), Random Forest (RF), Support Vector Machines (SVM), Neural Network (NN), Ex-tra\_Tree (ET), Ada\_Boost (AB) and Bagging (B). To do that, we compute detection accuracy metric for each algorithm and we use lexical analysis to extract the URL features.
DOI	10.1109/ISAECT53699.2021.9668489
Citation Key	bouijij_machine_2021

Groups:

Science of Security VO