Visible to the public Named Entity Recognition in Cyber Threat Intelligence Using Transformer-based Models

TitleNamed Entity Recognition in Cyber Threat Intelligence Using Transformer-based Models
Publication TypeConference Paper
Year of Publication2021
AuthorsEvangelatos, Pavlos, Iliou, Christos, Mavropoulos, Thanassis, Apostolou, Konstantinos, Tsikrika, Theodora, Vrochidis, Stefanos, Kompatsiaris, Ioannis
Conference Name2021 IEEE International Conference on Cyber Security and Resilience (CSR)
KeywordsBERT, Computer crime, cyber threat intelligence, dark web, Deep Learning, DNRTI, ELECTRA, Feeds, Human Behavior, named entity recognition, pubcrawl, RoBERTa, social networking (online), Training, visualization, Web pages, XLNet
AbstractThe continuous increase in sophistication of threat actors over the years has made the use of actionable threat intelligence a critical part of the defence against them. Such Cyber Threat Intelligence is published daily on several online sources, including vulnerability databases, CERT feeds, and social media, as well as on forums and web pages from the Surface and the Dark Web. Named Entity Recognition (NER) techniques can be used to extract the aforementioned information in an actionable form from such sources. In this paper we investigate how the latest advances in the NER domain, and in particular transformer-based models, can facilitate this process. To this end, the dataset for NER in Threat Intelligence (DNRTI) containing more than 300 pieces of threat intelligence reports from open source threat intelligence websites is used. Our experimental results demonstrate that transformer-based techniques are very effective in extracting cybersecurity-related named entities, by considerably outperforming the previous state- of-the-art approaches tested with DNRTI.
DOI10.1109/CSR51186.2021.9527981
Citation Keyevangelatos_named_2021