Unveiling MIMETIC: Interpreting Deep Learning Traffic Classifiers via XAI Techniques

Submitted by grigby1 on Wed, 12/22/2021 - 12:54pm

Title	Unveiling MIMETIC: Interpreting Deep Learning Traffic Classifiers via XAI Techniques
Publication Type	Conference Paper
Year of Publication	2021
Authors	Nascita, Alfredo, Montieri, Antonio, Aceto, Giuseppe, Ciuonzo, Domenico, Persico, Valerio, Pescapè, Antonio
Conference Name	2021 IEEE International Conference on Cyber Security and Resilience (CSR)
Date Published	July 2021
Publisher	IEEE
ISBN Number	978-1-6654-0285-9
Keywords	Deep Learning, encrypted traffic, explainable artificial intelligence, Internet, law enforcement, learning (artificial intelligence), Limiting, mobile applications, Mobile handsets, Multimodal learning, pubcrawl, resilience, Resiliency, Scalability, Traffic classification, xai
Abstract	The widespread use of powerful mobile devices has deeply affected the mix of traffic traversing both the Internet and enterprise networks (with bring-your-own-device policies). Traffic encryption has become extremely common, and the quick proliferation of mobile apps and their simple distribution and update have created a specifically challenging scenario for traffic classification and its uses, especially network-security related ones. The recent rise of Deep Learning (DL) has responded to this challenge, by providing a solution to the time-consuming and human-limited handcrafted feature design, and better clas-sification performance. The counterpart of the advantages is the lack of interpretability of these black-box approaches, limiting or preventing their adoption in contexts where the reliability of results, or interpretability of polices is necessary. To cope with these limitations, eXplainable Artificial Intelligence (XAI) techniques have seen recent intensive research. Along these lines, our work applies XAI-based techniques (namely, Deep SHAP) to interpret the behavior of a state-of-the-art multimodal DL traffic classifier. As opposed to common results seen in XAI, we aim at a global interpretation, rather than sample-based ones. The results quantify the importance of each modality (payload- or header-based), and of specific subsets of inputs (e.g., TLS SNI and TCP Window Size) in determining the classification outcome, down to per-class (viz. application) level. The analysis is based on a publicly-released recent dataset focused on mobile app traffic.
URL	https://ieeexplore.ieee.org/document/9527948
DOI	10.1109/CSR51186.2021.9527948
Citation Key	nascita_unveiling_2021

Groups:

Science of Security VO