Visible to the public A Phishing Detection Method Based on Data Mining

TitleA Phishing Detection Method Based on Data Mining
Publication TypeConference Paper
Year of Publication2021
AuthorsLi, Chunzhi
Conference Name2021 3rd International Conference on Applied Machine Learning (ICAML)
Keywordsdata mining, Deep Learning, feature extraction, Forestry, Human Behavior, Learning systems, lexical features, Malicious URL, Network security, Neural networks, phishing, pubcrawl, Random Forest
AbstractData mining technology is a very important technology in the current era of data explosion. With the informationization of society and the transparency and openness of information, network security issues have become the focus of concern of people all over the world. This paper wants to compare the accuracy of multiple machine learning methods and two deep learning frameworks when using lexical features to detect and classify malicious URLs. As a result, this paper shows that the Random Forest, which is an ensemble learning method for classification, is superior to 8 other machine learning methods in this paper. Furthermore, the Random Forest is even superior to some popular deep neural network models produced by famous frameworks such as TensorFlow and PyTorch when using lexical features to detect and classify malicious URLs.
DOI10.1109/ICAML54311.2021.00050
Citation Keyli_phishing_2021