Visible to the public Needle in a Haystack: Tracking Down Elite Phishing Domains in the Wild

TitleNeedle in a Haystack: Tracking Down Elite Phishing Domains in the Wild
Publication TypeConference Paper
Year of Publication2018
AuthorsTian, Ke, Jan, Steve T. K., Hu, Hang, Yao, Danfeng, Wang, Gang
Conference NameProceedings of the Internet Measurement Conference 2018
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-5619-0
KeywordsHuman Behavior, human factor, phishing, pubcrawl
Abstract

Today's phishing websites are constantly evolving to deceive users and evade the detection. In this paper, we perform a measurement study on squatting phishing domains where the websites impersonate trusted entities not only at the page content level but also at the web domain level. To search for squatting phishing pages, we scanned five types of squatting domains over 224 million DNS records and identified 657K domains that are likely impersonating 702 popular brands. Then we build a novel machine learning classifier to detect phishing pages from both the web and mobile pages under the squatting domains. A key novelty is that our classifier is built on a careful measurement of evasive behaviors of phishing pages in practice. We introduce new features from visual analysis and optical character recognition (OCR) to overcome the heavy content obfuscation from attackers. In total, we discovered and verified 1,175 squatting phishing pages. We show that these phishing pages are used for various targeted scams, and are highly effective to evade detection. More than 90% of them successfully evaded popular blacklists for at least a month.

URLhttps://dl.acm.org/doi/10.1145/3278532.3278569
DOI10.1145/3278532.3278569
Citation Keytian_needle_2018